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ABSTRACT 



DESIGN AND IMPLEMENTATION 
OF 
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A FREE CONSTITUENT ORDER LANGUAGE 
Dilek Zeynep Hakkani 
M.S. in Computer Engineering and Information Science 
Principal Advisor: Asst. Prof. Kemal Oflazer 
Co-advisor: Asst. Prof. Ilyas ^igekli 
July, 1996 



This thesis describes a tactical generator for Turkish, a free constituent or- 
der language, in which the order of the constituents may change according to 
the information structure of the sentences to be generated. In the absence of 
any information regarding the information structure of a sentence (i.e., topic, 
focus, background, etc.), the constituents of the sentence obey a default order, 
but the order is almost freely changeable, depending on the constraints of the 
text flow or discourse. We have used a recursively structured finite state ma- 
chine for handling the changes in constituent order, implemented as a right- 
linear grammar backbone. Our implementation environment is the GenKit sys- 
tem, developed at Carnegie Mellon University-Center for Machine Translation. 
Morphological realization has been implemented using an external morphological 
analysis/generation component which performs concrete morpheme selection and 
handles morphographemic processes. 



Key words: Natural Language Generation, Free Constituent Order Language, 
Realization, Grammar Theory. 
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Bu tezde, gerceklestirimi serbest oge diizenine sahip bir dil olan Tiirkce icin 
bir yiizeysel liretici sunulmaktadir. Bir ciimlenin bilgi yapisiyla ilgili herhangi 
bir bilginin (baslik, odak, arka plan, v.b.g.) olmamasi durumunda, tiimce ogeleri 
ongoriilen bir siraya uyarlar. Ancak bu sira, tiimcenin akisi veya konusmanm 
simrlamalarma gore serbetce degisebilir. Ogelerin sirasmdaki degisiklikleri liret- 
mek icin sag dogrusal gramer ile gerceklestirilmis bir 6z yinelemeli sonlu durum 
makinesi kuUamlmistir. Gerceklestirme ortamimiz, Carnegie Mellon Universitesi 
- Center for Machine Translation'da (CMU - CMT) gelistirilen GenKit sistemidir. 
Bicimbirimsel gerceklestirme, disaridan cagirilan, somut bicimbirim secimini ve 
bicimbirimsel degismeleri saglayan bicimbirimsel bir liretim sistemi kuUanilarak 
gerceklestirilmistir. 



Anahtar sozciikler: Dogal Dil Uretimi, Serbest Oge Sirali Diller, Gerceklestirme, 
Gramer Teorisi. 
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Chapter 1 
Introduction 



Natural Language Processing (NLP) is a research area which investigates com- 
putational systems that analyze, understand, process, and produce natural lan- 
guage. Every NLP system has one or both of the following subsystems: 

• Parser: a component which analyzes natural language sentences, and con- 
verts them into representations that can further be processed by the com- 
puter. 

• Generator: a component which produces natural language sentences from 
computer internal representations. 

Some applications of NLP systems are: machine translation systems, interfaces 
to database systems, speech understanding and production systems, and text 
skimming systems. In a machine translation system, the computer analyzes a 
given text in one language (called the source language), and then produces the 
translation of this text in another language (called the target language). The 
production of text in the target language is done by a natural language generation 
(NLG) system. In transfer-based machine translation, the generation system is 
a tactical generator. NLG systems also produce the results of the queries in a 
natural language in NLP interfaces to database, and the summaries of analyzed 
text in text skimming systems. 
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As a component of a large-scale project on natural language processing for 
Turkish, we have undertaken the development of a tactical generator. This tacti- 
cal generator can be used in machine translation applications, or in other natural 
language generation systems together with a strategic generator. 

We currently plan to use this tactical generator in prototype transfer-based 
human-assisted machine translation system from English to Turkish. The anal- 
ysis of the English sentences is done by an English analysis component. The 
transfer component transfers the output of the English analyzer, a representa- 
tion for an English sentence, into a representation for a Turkish sentence. The 
tactical generator, then, generates the surface form of the Turkish sentence, which 
is the translation of the input sentence into Turkish. An outline of this system 
is given in Eigure 1.1. Eor example, if the English sentence "The man wanted to 
read the book" is given as input to the English analysis component, it produces 
the following case-frame representation:^ 



S-FORM finite 
CLAUSE-TYPE predicative 



VOICE 

SPEECH- ACT 



ARGUMENTS 



active 
declarative 

ROOT want 
TENSE past 
AGR 3sg 



ROOT man 
DEFINITE -I- 

S-FORM inf 
CLAUSE-TYPE predicative 



VOICE 

SPEECH- ACT 
VERB 



ARGUMENTS 



active 
declarative 

ROOT read 



ROOT man 

DEFINITE -I- 

ROOT book 

DEFINITE -I- 



^We use the case-frame representation as a computer internal representation. A case-frame 
is a common representation for capturing the predication, arguments, and adjuncts involved in 
a sentence. We give the details of our case-frames in Chapter 4. 
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English Sentence 
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English Case-frame 


Transfer 






Turkish Case-frame 

1 



Turkish 
Tactical 



Generator 



V 

Turkish Sentence 

Figure 1.1: The outline of the machine translation project. 
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As can be seen above, this case-frame has the features, VERB, to capture 
the predication, and ARGUMENTS, to capture the arguments. The object of the 
sentence above is also a sentential clause, so the value of the OBJECT feature is a 
structure very similar to the case-frame. 

The transfer component converts this case-frame into another case-frame rep- 
resenting a Turkish sentence, which is the following case-frame: 



((s-form finite) 
(clause-type predicative) 
(voice active) 
(speech-act declarative) 
(verb 
((root "iste") 
(sense positive) 
(tense past) 
(aspect perfect))) 
(arguments 
( (subj ect 
( (referent 
((arg 

((concept "adam"))) 
(agr 

((number singular) 
(person 3))))))) 
(dir-obj 
( (roles 

((role act) 
(arg 

((s-form inf-ind-act) 
(clause-type predicative) 
(voice active) 
(speech-act declarative) 
(verb 
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((root "oku") 
(sense positive))) 
(arguments 
( (dir-obj 
( (referent 
((arg 

((concept "kitap"))) 
(agr 

((number singular) 
(person 3))))) 
(specifier 
( (quan 

((definite +)))))))))))))))))) 



Then the tactical generator generates the Turkish sentence "Adam kitabi okumak 
istedi." from this case-frame. During this generation process, it uses a Turkish 
grammar and lexicon, and imposes the right word order and generates the relevant 
morphological features. 

Turkish, our target language, can be considered as a subject-object-verb (SOV) 
language, in which constituents can change order rather freely, at certain phrase 
levels, depending on the constraints of text flow or discourse. This constituent or- 
der freeness comes from the fact that the morphology of Turkish enables morpho- 
logical markings on the constituents to express their grammatical roles without 
relying on their order. 

To develop a tactical generator for Turkish, we have used a recursively struc- 
tured finite state machine, which handles constituent order changes. As the sur- 
face constituent order is almost freely changeable depending on the constraints of 
the text fiow or discourse, these constraints obtained from the information struc- 
tures in the case-frames of the sentences to be generated guide the generator to 
emit the proper word order. In the absence of any information structure, the 
constituents of the sentence obey a default order. 
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Our implementation environment is the GenKit generation system [22], de- 
veloped at Carnegie Mellon University-Center for Machine Translation. Mor- 
phological realization has been implemented using an external morphological 
analysis/generation component which performs concrete morpheme selection and 
handles morphographemic processes. 



1.1 Overview of the Thesis 



The outline of the thesis is as follows: Chapter 2 describes briefly Natural Lan- 
guage Generation, and its phases (text planning, sentence planning, and real- 
ization), and the scope of our work. Chapter 3 presents an overview of Turkish 
syntax, emphasizing the concepts that we dealt with when designing the gram- 
mar. Chapter 4 describes our approach for generating Turkish sentences, together 
with the architecture of our grammar. We also provide here a comparison of our 
work with related work on Turkish. Chapter 5 presents an evaluation of our 
grammar with some example outputs of the generator, along with proposals for 
future work. Chapter 6 concludes the thesis. 



Chapter 2 



Natural Language Generation 



Natural language generation is the process of producing natural language sen- 
tences using specified communicative goals [f5]. This area of study investigates 
the way computer programs can produce high-quality natural language text from 
computer-internal representations of information [12]. The natural language gen- 
eration process consists of three main activities [f8]: 

f. The information that should be communicated to the user and the way 
this information should be structured must be determined. These, usually 
simultaneous, tasks are called as content determination and text planning^ 
respectively. 

2. The split of information among individual sentences and paragraphs must 
be determined (sentence planning). During this process, in order to make a 
smooth text flow, cohesion devices (such as pronouns) to be added, should 
be dictated. 

3. The individual sentences should be generated in a grammatically correct 
manner (realization). 

In most natural language generation systems there are two different parts [2, 25]: 

I. the strategic generator^ which implements the first two of the activities above, 
and 
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2. the tactical generator^ which implements the last one of the activities above. 

In the remaining of this chapter, we present an overview of strategic generation 
and tactical generation, followed by a description of the scope of our work. 

2.1 Strategic Generation 

As indicated above, the first two activities in natural language generation, that 
of identifying the goals the utterance is to achieve and planning the way these 
goals may be achieved, is called as strategic generation [15]. For example, in 
order to describe the event in the picture in Figure 2.1, at least these five Turkish 
sentences can be generated: 

a) Ali kitabi Ahmet ' e verdi . 

b) Kitap Ahmet ' e verildi. 

c) Kitap Ahmet ' e Ali tarafmdan verildi. 

d) Ahmet ' e kitap verildi. 

e) Ali Ahmet ' e kitabi verdi. 

A strategic generator determines which (words or concepts and) surface form you 
would use to describe this event, taking into account information that is mostly 
not linguistic, such as: 

• world knowledge, 

• previous context in discourse, 

• speaker intentions, etc. 







Ahmet 


Ali 




\ 












j 


/\ 



Figure 2.1: A picture for demonstrating the event of Ali's giving the book to 
Ahmet. 

2.2 Tactical Generation 

The tactical generator, realizes, as linear text, the contents of a sentence which 
are specified usually using some kind of a feature structure. This feature structure 
can be generated by a higher level process such as a strategic generator or transfer 
process in machine translation applications, as demonstrated in Figure 2.2. In 
this process a generation grammar and a generation lexicon are used. 

A natural language grammar is a formal device for defining the relation be- 
tween natural language utterances and the computer-internal representations to 
express their meaning [26]. The same grammar (a reversible grammar) can be 
used for both analysis and generation. But, problems of parsing and genera- 
tion are rather different. In parsing, ambiguity at all levels (lexical, syntactic, 
semantic) is a very serious problem. Whereas, in generation, the problem is non- 
determinism, production of more than one sentence from a computer-internal 
representation. A generation grammar must also be augmented with style-related 
information. However, this information can be ignored in an analysis grammar, 
if only the semantics of the sentence is needed [5]. 
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Machine 
Translation 



Transfer 



Turkish 
Grammar 



DB Interface 

Text Planner 
(Strategic Generator) 



Case-frame (Featui'e Structure 

representing contents) 



TACTICAL 
GENERATOR 



Turkish 
Lexicon 



Turkish Sentence 

Figure 2.2: The usage of a tactical generator 

2.3 Scope of Our Work 

Our main goal in this thesis is to develop a tactical generator for Turkish that we 
can use in a prototype machine translation system from English to Turkish. Our 
tactical generator gets a feature structure as input from the transfer component 
in this machine translation system, representing the contents of the sentence 
to be generated, where all lexical choices have been made. The feature struc- 
tures for these sentences are represented using a case-frame representation that 
will be detailed later. This information is then converted into a linear sequence 
of lexical feature structures. Then, in order to perform morphological realiza- 
tion, this output of the tactical generator is sent to an external morphological 
generation component which performs concrete morpheme selection and handles 
morphographemic phenomena such as vowel harmony, and vowel and consonant 
ellipsis and then produces an agglutinative surface form. As Turkish morphology 
is outside the scope of this work, we refer the reader to relevant work [17]. The 
interface of our tactical generator with other components is shown in Figure 2.3. 
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Figure 2.3: The interface of our tactical generator 



Chapter 3 



Turkish 



Turkish is a free constituent order language, in which the order of the constituents 
may change according to the information to be conveyed. In the absence of any 
information regarding the information structure of a sentence (i.e., topic, focus, 
background, etc.), the constituents of the sentence obey a default order, but 
otherwise the order is almost freely changeable, depending on the constraints of 
the text flow or discourse. In the next section, we present the components of 
the information structure which controls the constituent order variations and an 
overview of Turkish sentences, noun phrases, and sentential clauses, relevant to 
subsequent chapters. 



3.1 Information Structure 



In terms of word order, Turkish can be characterized as a subject-object-verb 
(SOV) language in which constituents at certain phrase levels can change order 
rather freely, depending on the constraints of text flow or discourse. This is due 
to the fact that the morphology of Turkish enables morphological markings on 
the constituents to signal their grammatical roles without relying on their order. 
For example, the word 'kitap' (book) case marked accusative is a definite direct 
object, the word 'ev' (house) case marked dative expresses a goal and the word 
'okul' (school) case marked ablative expresses a source: 
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id 



kitab+1 



(Definite dir. object - tlieme) 



book+ACC 



ev+e 



(Dative object - goal) 



house+DAT 



okul+dan 



(Ablative object - source) 



school+ABL 

This, however, does not mean that word order is immaterial. Sentences with 
different word orders reffect different pragmatic conditions, in that, topic, focus 
and background information conveyed by such sentences differ.^ Information 
conveyed through intonation, stress and/or clefting in fixed word order languages 
such as English, is expressed in Turkish by changing the order of the constituents. 
Obviously, there are certain constraints on constituent order, especially, inside 
noun and post-positional phrases. There are also certain constraints at sentence 
level when explicit case marking is not used (e.g., with indefinite direct objects). 

Information structure indicates how linguistically conveyed information is to 
be added to a context (hearer's information state) [24]. In free word order lan- 
guages, the information structure (topic, focus, and background) is indicated 
by the word order [8]. In Turkish, the information which links the sentence to 
the previous context, the topic, is in the first position [4]. For example, in the 
sentence (b) below, the direct object, which is a pronoun, is the topic of that 



Ay§e home+LOC very get-bored+PAST+3SG 

'Ay§e got very bored at home. ' 

^See Erguvanli [4] for a discussion of the function of word order in Turkish grammar. 

^In the glosses, 3SG denotes third person singular verbal agreement, PIPL and P3SG denote 
first person plural and third person singular possessive agreement, ¥ITH denotes a derivational 
marker making adjectives from nouns, LOG, ABL, DAT, GEI denote locative, ablative, dative, 
and genitive case markers, PAST denotes past tense, and IIP denotes a marker that derives an 
infinitive form from a verb. 



sentence: 



.2 




cok sikildi. 
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b. Onu da sinemaya cagirabilir miyim? 

She+ACC too cinema+DAT call+ABILITY+QUES+lSG 

'Can I call her to the cinema too?' 

The information which is new or emphasized, the /ocws, is in the immediately 
preverbal position [4]. For example, in the answer to the following question, the 
subject, "her mother", is the focus: 

(2) Q: Bu topu Ayse'ye kim aldi? 

this ball+ACC Ay§e+DAT who buy+PAST+3SG 

'Who bought this ball to Ay§e?' 

A: Bu topu Ayse'ye annesi aldi. 

this ball+ACC Ay§e+DAT mother+P3SG buy+PAST+3SG 

'Her mother bought this ball for Ay§e. ' 

The additional information which may be given to help the hearer understand 
the sentence, the background^ is in the post verbal position [4]. For example, in 
the second sentence below, the subject, "Ayse", which is also the subject and the 
topic of the first sentence, is the background. 

(3) a. Ayse biitiin kitaplarmi eve gotiirmek 

Ay§e all book+PLU+P3SG+ACC home+DAT bring+INF 

'Ay§e wanted to bring all her books to home. ' 

istedi. 

want+PAST+3SG 

b. Fakat, tarih kitabmi okulda unuttu Ayse. 

but history book+P3SG+ACC school+LOC f orget+PAST+3SG Ay§e 

'But she, Ay§e, forgot her history book at school. ' 

Thus the topic, focus and background information, when available, alter the order 
of constituents of Turkish sentences. 



3.2 Simple Sentences 

Turkish sentences can be grouped into three: 

1. predicative sentences, 

2. existential sentences, 

3. attributive sentences. 

Predicative sentences have predicative verbs inflected in the verb paradigm. 
The following are some example predicative sentences: 

(4) a. Kitaplarimi smifta unuttum. 

book+PLU+PlSG+ACC classroom+LOC f orget+PAST+lSG 

'I forgot my books in the classroom. ' 

b. (^ocuklari okula anneleri getirdi. 

child+PLU+ACC school+DAT mother+P3PL bring+PAST+3SG 

'Their mother brought the children to the school. ' 

Existential sentences have verbs denoting existence ('var' in Turkish) or nonex- 
istence ('yok' in Turkish)."^ The following are some example existential sentences: 

(5) a. Benim iki kalemim var. 

I+GEN two pencil+PlSG existent 

'I have two pencils. ' 

a. Odasmda perde bile yoktu. 

room+P3SG+L0C curtain even non-existent +PAST 

'There were even no curtains in her room. ' 

Attributive sentences have nominal verbs which express some property of the 
subject noun phrase. The following are some example attributive sentences: 

^These correspond to 'There is/are sentences in English. 
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(6) a. Bu cay cok sicak. 
this tea very hot 

'This tea is very hot. ' 

b. Kitaplar masamm lizerinde. 
book+PLU table+PlSG+GEN on+P3SG+L0C 

'The books are on my table. ' 

In the following sections, we present additional information about these three 
kinds of sentences including their constituents, default word order, etc. 

3.2.1 Predicative Sentences 

Predicative sentences are sentences whose verbs are inflected in the verb paradigm. 
Tj/p«ca/ constituents of such sentences are: subject, expression of time, expression 
of place, direct object, beneficiary, source, goal, location, instrument, value des- 
ignator, path, duration, expression of manner and verb (the verb is obligatory). 

• The subject is the syntactic subject. 

• Expression of time and expression of place are adjuncts. 

• Direct object is the syntactic direct object of the sentence. 

• Beneficiary is the person who is benefiting from an action or a state. 

• Source indicates the point of origin of a displacement, whereas 50a/ indicates 
the destination of a displacement. 

• Location denotes the spatial position of the predicate. 

• Instrument is the medium which the predicate is done with. 

• Value designator is the money which the action is taken for. 

• Path, duration^ and expression of manner are adjuncts. 



In the absence of any control information, such as the information structure 
components topic, focus, or background, (discussed earher, indicating discourse 
constraints) the constituents of Turkish sentences have the following default order: 

subject, expression of time, expression of place, direct object, benefi- 
ciary, source, goal, location, instrument, value designator, path, dura- 
tion, expression of manner, verb. 

All of these constituents except the verb are optional unless the verb obligatorily 
subcategorizes for a specific lexical item as an object in order to convey a certain 
(usually idiomatic) sense. For example, in the following sentence the direct ob- 
ject, 'kafa' ('head' in English) in accusative case, is obligatory for the idiomatic 
usage: 

(7) a. Adam kafayi yedi. 

man head+ACC eat+PAST+3SG 

'The man got mentally deranged. ' 

The definiteness of the direct object adds a minor twist to the default order. 
If the direct object is an indefinite noun phrase, then it has to be immediately 
preverbal. This is due to the fact that, both the subject and the indefinite 
direct object have no surface case-marking that distinguishes them, so word order 
constraints come into play to force this distinction. 

In order to present the fiavor of word order variations in Turkish, we provide 
the following examples. These two sentences are used to describe the same event 
(i.e., have the same logical form), but they are used in different discourse situa- 
tions. The first sentence presents constituents in a neutral default order, while 
in the second sentence, the time adjunct, 'bugiin' (today), is the topic and the 
subject, 'Ahmet', is the focus: 

(8) a. Ahmet bugiin evden okula otobiisle 3 dakikada 

Ahmet today home+ABL school+DAT bus+WITH 3 minute+LOC 

'Ahmet went from home to school by bus 
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gitti. 

go+PAST+3SG 

in 3 minutes today. ' 

b. Bugiin evden okula otobiisle 3 dakikada Ahmet 

today home+ABL school+DAT bus+WITH 3 minute+LOC Ahmet 

'It was Ahmet who went from home to school in 3 minutes 

gitti. 

go+PAST+3SG 

hy bus today. ' 

Although, sentences (b) and (c), in the following example, are both grammatical, 
(c) is not acceptable as a response to the question (a): 

(9) a. Ali nereye gitti? 

Ali where+DAT go+PAST+3SG 

'Where did Ali go?' 

b. Ali okula gitti. 

Ali school+DAT go+PAST+3SG 

'Ali went to school. ' 

c. * Okula Ah gitti. 

school+DAT Ali go+PAST+3SG 

'It was Ali who went to school. ' 

The word order variations exemplified by (9) are very common in Turkish, espe- 
cially in discourse. 

3.2.2 Existential Sentences 

Existential sentences are sentences which have a verb denoting existence ( root 
word is 'var') or nonexistence (root word is 'yok'). Typical constituents of ex- 
istential sentences are: poss-subj (possessor of subject), (expression of) time. 



(expression of) place, subject and verb. Poss-subj is separated from the sub- 
ject noun phrase, because Turkish allows for the intervention of time and place 
adjuncts between the possessor and the remaining of the subject noun phrase. 
The possessor can also move to any position of the sentence independently of the 
subject. For example, both of the following Turkish sentences are grammatical, 
and they have the same logical form: 

(10) a. Benim evde iki kitabim var. 

I+GEN home+LOC two book+PlSG existent 

'I have two books at home. ' 

b. Evde iki kitabim var benim. 
home+LOC two book+PlSG existent I+GEN 

'At home, I have two books. ' 

In the first sentence, the possessor of the subject noun phrase is at the sentence- 
initial (topic) position, whereas in the second sentence, the possessor of the sub- 
ject noun phrase is at the post verbal (background) position. 

In the absence of any control information, the constituents of existential sen- 
tences have the following default order: 

poss-subj, time, place, subject, verb 

The verb is again obligatory like in predicative sentences. Some other constituents 
may not intervene between the subject and the verb. The location for the con- 
stituent which is the focus, is the position immediately preceding the subject. 
The locations for topic and background are again the sentence initial position 
and post verbal position, respectively. 

3.2.3 Attributive Sentences 

Attributive sentences are used in order to express some property of an entity 
(the subject of the sentence). This may be the location, quality, quantity, owner. 
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order, etc. of the subject. The constituents of such sentences are: subject, pred- 
property, (expression of) time, and (expression of) place. The pred-property is 
the constituent conveying a property of the subject noun phrase. 

In the absence of any control information, the constituents of existential sen- 
tences have the following default order: 

subject, time, place, pred-property 

The pred-property can be a specifier, a modifier, or a noun phrase (which can be 
in any case, except the accusative case). For example, in the following sentences, 
the pred-property is a specifying-relation, a possessor, a qualitative modifier, a 
quantitative modifier, an ordinal, a noun phrase in the nominative case, and a 
noun phrase in the ablative case, respectively: 

(11) a. Kalemim masada. 

pencil+PlSG table+LOC 

'My pencil is on the table. ' 

(12) a. Bu kalem benim. 

this pencil I+GEN 

'This pencil is mine. ' 

(13) a. Bu kalem kirmizi. 

this pencil red 

'This pencil is red. ' 

(14) a. Kalemlerin sayisi iki. 

pencil+PLU+GEN number+P3SG two 

^ The number of pencils is two.' 



(15) a. Bu cocuk smifta ikincidir. 

this child classroom+LOC second+COPULA 

'This child has a rank of two in the classroom. 
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(16) a. Bu bir kopektir. 

this a dog+COPULA 
'This is a dog. ' 

(17) a. Gelisimiz Ankara'dan. 

come+PART+PlPL Ankara+ABL 

'Our coming is from Ankara. ' 

3.3 Complex Sentences 

Complex sentences are combinations of simple sentences (or complex sentences 
themselves) which are linked by either conjoining or various relationships like 
conditional dependence, cause-result, etc. An example complex sentence formed 
by the conjunction of two simple sentences is: 

(18) Kapiyi actim ve odaya girdim. 
door+ACC open+PAST+lSG and room+DAT enter+PAST+lSG 

'I opened the door and entered the room' 

The following sentences are also complex sentences formed by two simple sen- 
tences which are combined by conditional dependence (the first one) and cause- 
result relationship (the next two), respectively: 

(19) Kitabi okursan sorunun cevabmi 
book+ACC read+A0R+C0ND+3SG question+GEN answer+P3SG+ACC 

'If you read the book, you will find the answer to your question. 

bulacaksm. 
find+FUT+2SG 

(20) Sen geldigin icin o gitti. 

you come+PART+P2SG because he go+PAST+3SG 

'He went because you came. ' 



(21) 



Gelmesini 



istemedigimden 



onu 



come+INF+P3SG+ACC want+NEG+PART+PlSG+ABL he+ACC 



'Since I did not want him to come, I did not call him. ' 



cagirmadim. 



call+NEG+PAST+lSG 



3.4 



Noun Phrases in Turkish 



Noun phrases are one of the fundamental components of natural language sen- 
tences. They are used to denote entities and events in the real world. They 
function in many roles, such as the subject or the object in a sentence or a 
sentential clause. 

A noun phrase may consist of only one word, which can either be noun (or a 
simple modifier), or a pronoun, or it may consist of more than one word. The 
distinguished component of a noun phrase is called the head of the noun phrase. 
It can be specified, modified and/or classified by other constituents, referred to 
as specifiers, modifiers and classifiers, respectively. 

The constituents of a Turkish noun phrase have an almost fixed order: 

1. Set specifier, 

2. Possessor, 

3. Specifying relation, 

4. Demonstrative specifier, 

5. Quantifier, 

6. Modifying relation, 

7. Ordinal, 

8. Quantitative modifier. 



9. Qualitative modifier (zero or more), 

10. Classifier, and 

11. Head. 

The first five of the above constituents are specifiers, and the next four are modi- 
fiers. All of these constituents are optional. In the following sections, we describe 
these constituents in detail. 

As can be seen from this order, speficiers almost always precede modifiers and 
modifiers almost always precede classifiers, which precede the head noun, al- 
though there are numerous exceptions. Also, within each group, word order 
variation is possible due to a number of reasons: 

• The order of quantitative and qualitative modifiers may change: the aspect 
that is emphasized is closer to the head noun. For example, to denote "two 
red pencils", both of the following two noun phrases can be used in Turkish: 

(22) a. iki kirmizi kalem 
two red pencil 

'two red pencils ' 

b. kirmizi iki kalem 
red two pencil 

'two red pencils ' 

The indefinite singular determiner may also follow any qualitative modifier 
and immediately precede any classifier and/or head noun. 

• Depending on the quantifier used, the position of the demonstrative spec- 
ifier may be different. For example, in the first noun phrase below, the 
demonstrative specifier precedes the quantifier, whereas in the second one 
the quantifier precedes the demonstrative specifier: 

■*A classifier in Turltish is a nominal modifier which forms a noun-noun noun phrase, essen- 
tially the equivalent of book in forms like book cover in English. We use the term modifier for 
adjectival modifiers and not for nominal ones. 
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(23) a. bu birkac kalem 

this several pencil 

'these several pencils' 

b. biitiin o kagitlar 
all that paper+PLU 

'all those papers' 

This is a strictly lexical issue and not explicitly controlled by the feature 
structure, but by the information (stored in the lexicon) about the deter- 
miner used. 

• The order of lexical and phrasal modifiers (e.g., corresponding to a postposi- 
tional phrase on the surface) may change, if positioning the lexical modifier 
before the phrasal one causes unnecessary ambiguity (i.e., the lexical modi- 
fier in that case can also be interpreted as a modifier of some internal con- 
stituent of the phrasal modifier). For example, the first noun phrase below 
has two interpretations. However, there is no such problem in the second 
one: 

(24) a. iki kalemli adam 

two pencil+WITH man 

'two men with a pencil' 

'a man with two pencils' 

b. kalemli iki adam 

pencil+WITH two man 

'two men with a pencil' 

So, phrasal modifiers always precede lexical modifiers and phrasal specifiers 
precede lexical specifiers, unless otherwise is specified, in which case punc- 
tuation needs to be used. 

• A modifier may come after the classifier. For example in the first noun 
phrase below an ordinal modifier intervened between the classifier and the 
head, and in the second one, the intervening constituent is a qualitative 
modifier: 



(25) a. futbol birinci ligi 

soccer first league+P3SG 

'first soccer league ' 

b. Tiirkiye milli parklari 

Turkey national park+PLU+P3SG 

'national parks of Turkey' 

• The possessor may scramble to a position past the head or even outside the 
phrase (to a background position), or allow some adverbial adjunct inter- 
vene between it and the rest of the noun phrase, causing a discontinuous 
constituent. For example, the possessor of the subject in the following sen- 
tence has moved to a background position: 

(26) a. Kedisini gordiin mil Ayse'nin? 

cat+P3SG+ACC see+P2SG+QUES Ay§e+GEN 

'Did you see Ay§e's cat?' 

Although we have included control information for scrambling the possessor 
to post head position, we have opted not to deal with either discontinuous 
constituents or long(er) distance scramblings as these are mainly used in 
spoken discourse. 

• Furthermore, since the possessor information is explicitly marked on the 
head noun, sometimes the discourse does not require an overt possessor. 
For example, if the owner of the pencil is not to be emphasized, both of the 
following noun phrases can be used to denote "your pencil": 

(27) a. senin kalemin 

you+GEN pencil+P2SG 

'your pencil' 

b. kalemin 

pencil+P2SG 

'your pencil' 

In the following subsections, we present the constituents of a noun phrase in 
depth. 
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3.4.1 Specifiers 

Specifiers are constituents of noun plirases wliicli are used to distinguisli tlie liead 
noun out of a set of possible similar nouns in the context. The specifiers of a noun 
phrase are: the quantifier, the demonstrative specifier, the specifying relation, the 
possessor, and the set specifiers. 

The Quantifier 

Quantifiers are used to pick out the quantity of items denoted by the head noun. 
The Turkish quantifiers are: 'her' (every), 'biraz' (a little), 'bazi' (some), 'birkac' 
( a few), 'bircok' (many), 'biitiin' (all), 'trim' (all), 'kimi' (some) and the indefinite 
article 'bir' (a/an). 

In Turkish, some quantifiers can only specify heads that are morphologically 
marked plural, and some can only specify heads that are singular. For example: 

(28) a. her insan 

every human 

'every human' 

b. * her insanlar 

every human+PLU 

(29) a. bazi masalar 

some table+PLU 

'some tables' 

b. * bazi masa 
some table 

The countability of the head also plays an important role in the selection of 
the quantifier. Some quantifiers can only specify countable heads, whereas some 
others can only specify uncountable heads. 



In Turkish, sometimes a demonstrative specifier and a quantifier can specify 
tlie same liead. Tliis is a property of tlie quantifier. For example: 

(30) a. su birlcac ogrenci 

that several student 

'those several students' 

b. * bu bazi Icitaplar 
this some book+PLU 

The order of the demonstrative specifier and the quantifier may also change, 
depending on the quantifier used. For example: 

(31) a. biitiin bu Icalemler 

all this pencil+PLU 

'all these pencils' 

b. * birlcac su ogrenci 
some that student 

All of the above properties of quantifiers are coded in the lexicon. 

The indefinite article 'bir' (a/an) can also be considered as a quantifier. With 
'bir', the word order can change and it may occur between the qualitative mod- 
ifiers and the classifier. The surface form of the indefinite article, 'bir', is also 
the surface form of the cardinal 'one'. If the word 'bir' is preceding a qualitative 
modifier in the surface form, it can either be an indefinite article or a cardinal, 
but if it is succeeding a qualitative modifier, then it is an indefinite article. 

The presence of the indefinite article depends on the definiteness, specificity, 
and referentiality of the head noun. These are explained in detail in the following 
sections. 



The Demonstrative Specifier 

Demonstrative specifiers are used to point out items [16, p. 145]. Turkish 
demonstrative specifiers are: 'bu' (this), 'su' (that), and 'o' (that). The demon- 
strative specifiers distinguish between the degrees of proximity to the spealcer. 
'bu' is used to point out items that are near the spealcer. 'su' and 'o' are used to 
point out items which are not near the spealcer, but the items pointed out by 'su' 
are cioser (and possibiy visibie) to the spealcer than the ones pointed out by 'o'. 

The Specifying Relation 

Specifying relations are used to pick; out items by giving their reiationship with 
other items. In Turkish, a specifying relation is a postpositional phrase formed 
by a noun phrase and one of the postpositions 'ait' (belonging to), 'dair' (about), 
etc. or an adjectival phrase, formed by the -|-ki relativizer from a singular noun 
phrase (with temporal or spatial location semantics) in the nominative case, or 
a noun phrase in the locative case. 

If the specifying relation is a postpositional phrase, then the postposition gives 
the relationship of its argument noun phrase, with the head. For example, in the 
following noun phrase, the specifying relation is a postpositional phrase and the 
postposition gives a relation of ownership: 

(32) a. Ali'ye ait kitap 

Ali+DAT belonging-to book 

'The book that belongs to AW 

If the postpositional phrase mentions a spatial location, it gets the +ki relativiza- 
tion suffix. For example: 

(33) a. hastaneden onceki ev 

hospital+ABL bef ore+REL house 

'house before the hospital' 

The specifying relation which is an adjectival phrase, mentions a spatial or 
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temporal location. The following are the examples of noun phrases having an 
adjectival phrase specifying relation: 

(34) a. Ali'nin evdeki kitabi 

Ali+GEN home+LOC+REL book+P3SG 

'Ali's book at home' 

(35) a. diinkii smav 

yesterday+REL examination 

'the exam yesterday' 



The Possessor 



Possessor distinguishes an item by expressing its owner. ^ In Turkish, the posses- 
sor is a noun phrase with a genitive case marker. The agreement of the possessor 
should be the same as the possessive marker of the head noun, when it is present.^ 
The information expressed by the possessor, can also be expressed by the posses- 
sive marker of the head alone, if such emphasis is not necessary in the context. 
The following two noun phrases, then, have almost the same semantics, but in 
the first one it is emphasized that 'the book is mine, as opposed to somebody 
else's', while the latter is neutral: 

(37) a. benim kitabim 

my book+PlSG 

'my book' 

b. kitabim 
book+PlSG 

'my book' 

®This ownership includes any kind of possession. For example, in the noun phrase "John's 
book", John may also be the writer of the book. 

®In general, the head noun need not have a possessive in Turkish, if there is a possessor. For 
example, the following is also a grammatical noun phrase: 

(36) a. benim kitap 
my book 

'my book' 

But, such forms are used very infrequently, and should probably not be dealt with at this point. 



Therefore, (37b) cannot be used in the answer to (38a) in the following discourse, 
where the owner of the book should be emphasized: 

(38) a. Kimin kitabi kalm? 

whose book+P3SG thick 

'Whose book is thick?' 

b. Benim kitabim kalm. 
I+GEN book+PlSG thick 

'My book is thick. ' 

c. * Kitabim kalm. 

book+PlSG thick 

The possessor is the only constituent in a noun phrase, which can move from 
its position to a position past the head. For example, both of the following sen- 
tences mention that 'my house is beautiful', but they convey different information, 
regarding focus, topic and background: 

(39) a. Benim evim giizeldir. 

I+GEN house+PlSG beautiful+COPULA 

'My house is beautiful. ' 

b. Evim giizeldir benim. 

house+PlSG beautiful+COPULA I+GEN 

'My house is beautiful. ' 

The possessor of the subject is backgrounded in the latter. 
The Set Specifier 

If a noun phrase is specified by a set specifier, then the noun phrase denotes 
the members of some set, which have some distinguishing identity or property, or 
which are of some quantity. The set specifier is a noun phrase which is semanti- 
cally plural and is in the ablative case. For example: 
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(40) a. akrabalardan amcamm kizi 

relative+PLU+ABL uncle+PlSG+GEN daughter+P3SG 

'my uncle's daughter among relatives' 

If the head, which is specified by a set specifier, is the same as that of the set 
specifier, then it drops. In such cases, one of the modifiers of the head act as the 
head. For example: 

(41) a. adamlardan bu ikisi 

man+PLU+ABL this two+P3SG 

'these two among the men' 

b. kazaklardan evdeki ikisi 
pullover+PLU+ABL home+LOC+REL two+P3SG 

'among the pullovers, the two at home' 

c. kitaplardan en kahni 
book+PLU+ABL most thick+P3SG 

'the thickest one among the books' 

3.4.2 Modifiers 

Modifiers are constituents of noun phrases which give information about the 
properties of the concept denoted by the head noun, or about the relations of its 
properties with properties of other concepts. The modifiers of a noun phrase are: 
the modifying relation, the ordinal, the quantitative modifier, and the qualitative 
modifier. 

The Modifying Relation 

Modifying relation gives information about the properties of a concept. This 
property can also be given in comparison with another concept. In Turkish, 
a modifying relation can either be a postpositional phrase, like the specifying 
relation, or a noun phrase followed by one of the suffixes +DAn (ablative, made 
of), +1H (with), +sHz (without), +DA (locative, made on), or +lHk (of). 



The postpositional phrase gives the property in comparison with another item. 
It is formed by a noun phrase and one of the postpositions 'gibi' (hke), 'kadar' 
(as much as), 'once' (before), etc. For example: 

(42) a. at gibi kopek 

horse like dog 

'dog like a horse ' 

The case of the noun phrase should match the subcategorization requirement 
of the postposition used. If the head of the noun phrase the whole noun phrase 
is compared to is the same as the head of the whole noun phrase, then it may be 
eliminated from the surface form. For example: 

(43) a. ? evdeki elmalar kadar elma 

home+LOC+REL apple+PLU as many as apple 

'as many apples as there are at home ' 

b. evdeki kadar elma 

home+LOC+REL as many as apple 

'as many apples as there are at home ' 

The modifying relation can also be a noun phrase followed by one of the 
suffixes: 

1. +DAn (ablative case marking indicating "made of" relationship), 

(44) tahtadan masa 
wood+ABL table 

'table made of wood' 

2. +1H (adjective derivation suffix indicating "with"), 

(45) ortiilii masa 
cover+WITH table 

'table with cover' 

3. +sHz (adjective derivation suffix indicating "without"). 
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(46) ortiisiiz masa 
cover+WITHOUT table 

'table without cover' 

4. +DA (locative marker indicating relationship "made-on"), 

(47) kiremitte sis 
brick+LOC kebap 

'kebap (cooked) on brick' 

5. or +lHk (adjective derivation suffix indicating "has measurable property" 
relationship), 

(48) iki kiloluk karpuz 

two kilo+OF watermelon 

'a watermelon weighing two kilos' 

If the suffix is +DAn, then the head of the noun phrase must have the semantic 
property of denoting a material, though this may be relaxed. 



The Ordinal 



An ordinal expresses the order of an item. In Turkish, an ordinal modifier can be 
one of: ilk (first), birinci (first), ikinci (second), liciincii (third), etc., and sonuncu 
(last). Although the words 'ilk' and 'birinci' seem to have the same semantics, 
they have a slightly different behaviour. 

In Turkish, there can be an intensifier which modifies the ordinal. This is the 
adverb 'en' (most). For example: 

(49) en sonuncu masa 
most last table 

'the (most) last table' 

But this adverb can only intensify the ordinals 'birinci' and 'sonuncu'.'' 
^Note that 'birinci' can take an adverbial intensifier, whereas 'ilk' cannot. 



The Quantitative Modifier 

A quantitative modifier expresses the quantity of the referent denoted by the 
head noun. It may be, one of: 

1. a cardinah 

(50) evdeki iki masa 
home+LOC+REL two table 

'two tables at home' 

2. a range: 

(51) bu lie bes kalem 
this three five pencil 

'these 3 to 5 pencils' 

3. an adjective that expresses a fuzzy quantity (for example: az or cok): 

(52) cok elma 
lot apple 

'lots of apples ' 

Note that this adjective can also be followed by one of the words 'miktarda' 
or 'sayida', if the head is uncountable and countable, respectively. This 
usage is more formal. 

4. a noun phrase, where the head is a container or a measure noun modified by 
a quantitative modifier of type cardinal or range. If the head is a container, 
then the noun phrase specifying the quantity may also be followed by the 
word 'dolusu', meaning 'full of. For example:® 

(53) a. lie bardak su 

three glass water 

'three glasses of water' 

b. lie bardak dolusu su 
three glass full of water 

'three glassfuls of water' 

^Note that (48) and (53c) are different. 



c. iki kilo karpuz 

two kilo watermelon 

'two kilos of watermelon' 

In Turkish, a head cannot be modified by a quantitative modifier and specified 
by a quantifier at the same time, though some quantifiers allow for the presence 
of a cardinal. For example: 

(54) a. her iki kalem 

every two pencil 

'both pencils' 

b. * bazi iki masalar 

some two table+PLU 

The cardinal may specify the quantity as an upper or a lower limit. For 
example: 

(55) a. en az iki elma 

most little two apple 

'at least two apples' 

b. en cok iki kalem 
most much two pencil 

'at most two pencils' 

The limit may also be given as a range as well as a cardinal. For example: 

(56) a. en az iki tie kisi 

most less two three person 

'at least two to three people ' 

The cardinal which specifies a quantity may be followed by one of the words 
'adet', 'tane' and 'parca' (all meaning 'piece'), but if the quantitative modifier 
is also a noun phrase and its head is a measure noun, then these words cannot 



be present between the cardinal and the measure noun in the surface form. For 
example: 

(57) a. iki elma 

two apple 

'two apples' 

b. iki adet elma 
two piece apples 

'two apples' 

c. * iki adet kilo elma 

two piece kilo apple 

If the head is modified by a quantitative modifier which is a cardinal greater 
than one, or a quantitative modifier which is a range, then this noun phrase is 
semantically plural. But, in Turkish, such a noun phrase is not morphologically 
marked plural at the same time. 

The Qualitative Modifiers 

A qualitative modifier expresses qualitative properties of a concept. There 
may be any number of qualitative modifiers modifying the head. In Turkish, a 
qualitative modifier is an adjective phrase, which is formed from ordinary adjec- 
tives. For example: 

(58) biiyiik sari kapi 
big yellow door 

'big yellow door' 

An adjective in the adjective phrase can also be modified by an adverb or a 
postpositional phrase which function as intensifiers. In the noun phrase in (59a), 
the qualitative adjective 'biiyiik' (big) is modified by an adverb, whereas in the 
noun phrase in (59b), it is modified by a postpositional phrase [23, p. 160]: 
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(59) a. en biiyiik masa 

most big table 

'the biggest table ' 

b. at kadar biiyiik bir kopek 
horse as big a dog 

'a dog as big as a horse ' 

The order of quantitative and qualitative modifiers modifying the head is not 
fixed. The one that is emphasized is closer to the head in the surface form. For 
example, in the first phrase below, the fact that 'the tables are big' is emphasized, 
while in the second one, the fact that 'there are two tables' is emphasized: 

(60) a. iki biiyiik masa 

two big table 

'two big tables' 

b. biiyiik iki masa 
big two table 

'two big tables' 

3.4.3 Classifiers and the Head 

The head of a noun phrase is either a proper noun, common noun or a pronoun. 
Pronouns cannot take any specifiers, modifiers, or classifiers. Nouns can further 
be classified by a set of classifiers. A classifier can also be a common noun, 
or it can itself be a noun phrase which can only have a classifier, classifying the 
head and/or modifiers modifying the head. Furthermore, the head of a noun 
phrase can also be a noun phrase having only a classifier classifying the head. 
For example, in the following noun phrase, the head and the classifier are both 
noun phrases, each having a classifier: 

(61) kredi karti komisyon orani 
credit card+P3SG commission rate+P3SG 

'credit card commission rate ' 




classifier liead 
NP NP 



classifier head classifier head 
kredi karti komisyon orani 

Figure 3.1: The structure of the noun phrase in (61) 

The tree in Figure 3.1 is given to demonstrate the structure of this noun phrase. 

A modifier may intervene between the classifier and the head. The following 
noun phrase is an example to such a form:^ 

(62) a. Disisleri eski Bakani 

foreign affairs old minister+P3SG 

'old Minister of Foreign Affairs' 

The head of the noun phrase can sometimes drop. This is usually the case, 
when the head is already introduced into the discourse, and its other distinguish- 
ing properties also need to be introduced. In such a case, the classifier of the noun 
phrase, if present, drops, too. A modifier or a specifier (only the possessor, the 
quantifier, or the specifying relation) of the noun phrase, which is immediately 
preceding the head in the normal surface form substitutes for the head and gets 
the case, number and possessive markings of the head. If this element is posses- 
sor, then it gets the suffix +ki, and then only the number and case markings. For 
example: 



(63) a. Kimin kitabi kalm? 

whose book+P3SG thick 

'Whose book is thick?' 



^See also earlier examples. 



b. Ayse'nin kitabi kalm. 
Ay§e+GEN book+P3SG thick 

'Ay§e's book is thick.' 

c. Ayse'ninki kahn. 
Ay§e+GEN+REL thick 

'That of Ay§e is thick' 

If the element which is substituting for the head, is a modifying element other 
than the possessor, it gets both of the number, case and possessive markings of 
the head. For example: 

(64) a. Hangi kalemi istiyorsun? 

which pencil+ACC want+PRG+2SG 

'Which pencil do you want?' 

b. Kirmiziyi istiyorum. 
red+ACC want+PRG+lSG 

'I want the red (one).' 

3.4.4 Definiteness, Specificity, and Referent iality 

Three main distinctions that underlie the interpretation of a noun phrase are 
definiteness, specificity, and referentiality. These are some of the factors which 
contribute in the determination of the case of the noun phrase and in the deter- 
mination of the presence of the indefinite article. 

• If it is possible for the hearer to build an unambiguous link between a noun 
phrase, and an entity or a group of entities, then the noun phrase is definite. 
The underlined noun phrase in the following sentence is an example to a 
definite noun phrase: 

(65) Mary dropped the book on the table. 

The underlined noun phrases in the following sentences are examples of 
definite and indefinite noun phrases in Turkish, respectively: 
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(66) a. Ahmet tarih kitabim okuyor. 

Ahmet history book+P3SG+ACC read+PRG+3SG 

'Ahmet is reading his history book. ' 

b. Ali'ye bir kitap verdim. 

Ali+DAT a book give+PAST+lSG 

'I gave a book to Ali. ' 

• If the entity that the noun phrase is hnked to is contextually bound (i.e. 
element of the universe of discourse), then the noun phrase is specific [13]. 
The underhned noun phrase in the following sentence is an example to a 
noun phrase, which is definite and specific: 

(67) Three cats entered the kitchen. They ate the cake. 

The underlined noun phrases in the following sentences are examples of 
specific and non-specific noun phrases in Turkish, respectively: 

(68) a. Ali kitap okumayi sevmiyor. 

Ali book read+INF+ACC like+NEG+PRG+3SG 

'Ali doesn't like reading book. ' 

b. Kitabmi okulda birakti. 

book+P3SG+ACC school+LOC leave+PAST+3SG 

'He left his book at school. ' 

• A noun phrase is referential, if there is a particular object or a set of objects 
within the relevant universe of discourse, that the noun phrase refers to [13]. 
The underlined noun phrases in the following sentence are examples to noun 
phrases in English and Turkish, which are used referent i ally: 

(69) Mary went to the cinema to watch 'Blues Brothers'. 

(70) Kiitiiphaneye kitap okumak icin gittim. 
library+DAT book read+INF for go+PAST+lSG 

'I went to the library for reading a book. ' 

A noun phrase is non-referential, if there is no such object and the noun 
phrase denotes a concept without pointing out any particular individual [13]. 
The underlined noun phrases in the following sentences are examples to noun 
phrases in English and Turkish, which are used non-referentially: 
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(71) Jim went to high-school in Paris. 

(72) Ali okula alti yasmda basladi. 

Ali school+DAT six age+P3SG+L0C start+PAST+3SG 

'Ali started school when he was six years old. ' 

Definiteness, specificity, and relerentiality ol noun phrases can be signaled 
by various strategies like morphological marking, word order, stress and context. 
The marking of noun phrases in Turkish and how this correlates with definiteness. 
specificity, and relerentiality are summarized [4] in the following table:^° 





Referential 


Non-referential 




definite 


Indefinite 








Specific 


Non-specific 




Subject sg. 




bir 


bir 




pi. 


+lAr 






+IAt 


Object sg. 


_+yH 


bir +yH 


bir (+yH) 




pi. 


+lAr+yH 






-FlAr-FyH 


Oblique sg. 


_+CASE 


bir _+CASE 


bir _+CASE 


_+CASE 


pi. 


_+lAr+CASE 






_+lAr+CASE 



3.4.5 Multiple Specifiers and Modifiers 

There may be more than one modifier or specifier of one kind, modifying or 
specifying the head, or a noun phrase may be a conjunction (or a disjunction) of 
more than one noun phrases, forming a list. Examples to these can be given as 
follows: 

"'^'^-l-yH is the accusative case morpheme; the glide [y] is omitted when the accusative ending 
is attached to a word ending in a consonant, H denotes a high vowel {i,i,u,ii} resolved according 
to vowel harmony rules. 

-|- lAr is the plural morpheme (A denotes {a,e}). Parentheses indicate optionality. 



4Z 



(73) a. tahtadan, ortiisiiz masa 
wood+ABL cover+WITHOUT table 

'a table made of wood, without a cover' 

b. tahtadan biiyiik ve yesil masa 
wood+ABL big and green table 

'big and green table made of wood' 

c. ihtiyar adam ve deniz 
old man and sea 

'the old man and the sea' 

d. iki veya tie araba 
two or three car 

'two or three cars' 

If the elements of this hst are modifying relations or qualitative modifiers 
modifying a head, then there may be a comma between the elements. Instead of 
a comma, there may also be a conjunction or a disjunction between the last two 
elements. If the elements of this list are noun phrases, then there should be a 
disjunction or a conjunction between the last two elements. 



3.5 Sentential Clauses 

Sentential clauses correspond to either: 

• full sentences with non-finite or participle verb forms which act as noun 
phrases in either argument or adjunct roles, or 

• gapped sentences with participle verb forms which function as modifiers of 
noun phrases (the filler of the gap). 

The former non-gapped forms in Turkish can be further classified into those 
representing acts^ facts and adverbials. Figure 3.2 exhibits the classification of 
sentential clauses. 



Sentential Clauses 




indefinite acts definite acts 



Figure 3.2: The classification of sentential clauses 

Sentential arguments of verbs adhere to the same morphosyntactic constraints 
as the nominal arguments. For example, the participle of, say, a clause that acts 
as a direct object is case-marked accusative, just as the nominal one would be. 
The subject and the direct object of the following sentence are both sentential 
clauses, and the direct object is case marked accusative because of the reason 
above. 

(74) Ali'nin buraya gelmesi bizim isi bitirmemizi 

Ali+GEN here come+INF+P3SG we+GEN j ob+ACC f inish+INF+PlPL+ACC 

'Ali's coming here made us finish the job easier.' 

kolaylastirdi. 
make_easy+PAST+3SG 

In the following subsections, acts, facts, adverbials, and gapped sentential 
clauses will be presented in detail. 

3.5.1 Acts 

The sentential clauses representing acts are classified into two: those representing 
indefinite acts and definite acts. The verbs of the sentential clauses representing 
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an indefinite act end with the suffix +mAk (+mak or +mek). For example, the direct 
object of the following sentence, 'kitap okumak' ('to read a book' in English), is 
a sentential clause representing an indefinite act. 

(75) Kitap okumak istiyorum. 
book read+INF want+PRG+lSG 

'I want to read a book. ' 

The verbs of the sentential clauses representing a definite act take the suffix 
+mA (+ma or +me) or +H§ (+i§, +i§, +u§, or+ii§) and a possessor suffix agreeing 
with the subject of the sentential clause. The direct object of the following 
sentence, 'gelmesi' ('his coming' in English), is an example to a sentential clause 
representing a definite act: 

(76) Gelmesini istedim. 
come+INF+P3SG+ACC want+PAST+lSG 

'I wanted him to come. ' 

3.5.2 Facts 

Sentential clauses which correspond to full sentences representing facts have par- 
ticiple verb forms. Their verbs can be in one of the two forms, depending on the 
time of the event denoted by the sentential clause. If that time precedes the time 
of the event expressed by the verb of the main sentence, or is concurrent, then 
the verb of the sentential clause takes a +dHk suffix, otherwise it takes a +yAcAk 
suffix. Eurthermore, the verb of the sentential clause takes a possessor suffix, 
which agrees with the subject of the sentential clause. The direct objects of the 
following sentences are examples to such forms, respectively: 

(77) Kitap okudugumu gordii. 

book read+PART+PlSG+ACC see+PAST+3SG 

'He saw that I am reading a book. ' 
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(78) Gelecegini zannettim. 
come+PART+P3SG+ACC think+PAST+lSG 

'I thought that he will come. ' 

3.5.3 Adverbials 

The verb forms of the sentential clauses which represent adverbials depend on 
the role and semantics of the sentential clause. The suffix that the verb gets can 
be +ArAk for a manner adverbial, +ken for a manner or a time adverbial, +yIncA 
for a time adverbial, etc. The manner adjunct of the following sentence is an 
example manner adverbial, and the time adjunct of the second sentence is an 
example temporal adverbial: 

(79) a. [Kosarak] odaya girdi. 

run+ADVB room+DAT enter+PAST+3SG 

'He entered the room running. ' 

b. [Onu buraya gelirken] gordiim. 

he+ACC here+DAT come+AOR+ADVB see+PAST+lSG 

'I saw him while he was coming here. ' 

A list of the suffixes for making adverbials is given in Appendix A. 

3.5.4 Gapped Sentential Clauses 

Gapped sentential clauses correspond to gapped sentences with participle verb 
forms (relative clauses) and they function as a modifier of the noun phrase, which 
is the filler of the gap. There are two strategies of relativization: subject participle 
(the gapped constituent is the subject of the clause) and object participle (the 
gapped constituent is anything other than the subject) [f, 6]. These differ in 
morphological markings on the verb and the subject of the clause and in the 
presence of an agreement between the subject and the verb of the clause. The 
selection of morphological markings depends on the thematic role of the gap, 
voice and transitivity of the verb of the sentential clause, and the specificity of the 
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subject of the sentential clause. The suffix that the verb of the sentential clause 
gets can be: +yAn, +yAcAk, +mH§ alone, or +dHk, +yAcAk plus a possessive 
suffix. The table given in Appendix B gives the suffix in different situations and 
an example noun phrase corresponding to each form. 

In subject participles, since the subject is gapped, it is not marked genitive, 
and there is no agreement between the verb and the subject. The direct object 
of the following sentence is an example to subject participles: 

(80) [ i Odaya giren] adami^ biliyorum. 

room+DAT enter+PART man+ACC know+PRG+lSG 

'I know the marii whoi entered the room. ' 

In object participles, the subject of the clause is marked genitive and the verb 
is marked with a possessive suffix (agreeing with the subject). The direct object 
of the following sentence is an example to object participles: 

(81) [Onun i verdigi] kitabi^ okudum. 

he+GEN enter+PART+P3SG book+ACC read+PAST+lSG 

'I read the booki (thati) he gave me.' 

In Turkish, unbounded relativization, relativization in embedded clauses, is 
also possible. The direct object of the following sentence is an example to un- 
bounded dependencies: 

(82) Adam [[[ i okumak] istedigini] soyledigi] kitabi^ 

man read+INF want+PART+P3SG+ACC say+PART+P3SG book+ACC 

'The man didn't read the hook that he told he wanted to read. ' 

okumadi. 

read+NEG+PAST+3SG 

The details of relativization in Turkish are investigated by Barker et.al. [I] 
and Giingordii [6]. 



Chapter 4 



Generation of Turkish Sentences 



This chapter describes the generation of Turkish sentences. In the first section, 
we explain the architecture of our generator. Then, we present a comparison of 
our work with other related works. 

4.1 The Architecture of the Generator 
4.1.1 Approach 

In order to generate Turkish sentences of varying complexity, we have designed 
a recursively structured finite state machine which can also handle the changes 
in constituent order. Our implementation environment is the GenKit system 
[22], developed at Carnegie Mellon University-Center for Machine Translation. 
Morphological realization has been implemented using an external morphological 
analysis/generation component, developed using XEROX Two Level Tools, which 
performs concrete morpheme selection and handles morphographemic processes. 

The generation process gets as input a feature structure representing the con- 
tent of the sentence where all the lexical choices have been made, then produces 
as output the surface form of the sentence. The feature structures for sentences 
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are represented using a case-frame representation. The fact that sentential argu- 
ments of verbs adhere to the same morphosyntactic constraints as the nominal 
arguments enables a nice recursive embedding of case-frames of similar general 
structure to be used to represent sentential arguments. 



4.1.2 Simple Sentences 



In this section, we will explain the generation of predicative, existential, and 
attributive sentences. Our input feature structures and finite state machines 
for giving the outlines of predicative, existential, and attributive sentences differ 
slightly. Therefore, for each kind of simple sentence, we present the case-frame 
and the corresponding finite state machine. 



Predicative Sentences 



We use the following general case-frame feature structure to encode the contents 
of a predicative sentence: 



S-FORM 

CLAUSE-TYPE 

VOICE 

SPEECH-ACT 



QUES 



VERB 



ARGUMENTS 



ADJUNCTS 



CONTROL 



infinitive / adverbial/participle / finite 
predicative 

active/ reflexive / reciprocal / passive / causative 
imperative / optative / necessitative/ wisli/ 
interrogative/ declarative 

TYPE yes-no/wh 

CONST list-of( subject /dir-obj/etc.) 

ROOT verb 

POLARITY negative/positive 

TENSE present /past /future 

ASPECT progressive/habitual/etc. 

MODALITY potentiality 

SUBJECT 
DIR-OBJ 



SOURCE 

GOAL 

LOCATION 

BENEFICIARY 

INSTRUMENT 

VALUE 



c-name 
c-name 
c-name 
c-name 
c-name 
c-name 
c-name 
c-name 



TIME 
PLACE 
MANNER 
PATH 



c-name 
c-name 
c-name 
c-name 



DURATION c-name 



IS 



ES 



TOPIC constituent 
FOCUS constituent 
BACKGR constituent 

EVEN const ituent/poss-const ituent 
TOO constituent/poss-constituent 
QUES constituent/poss-constituent 



We use the information given in the CONTROL | IS feature to guide our gram- 
mar in generating the appropriate sentential constituent order. This information 
is exploited by a right linear grammar (recursively structured nevertheless) to 
generate the proper order of constituents at every sentential level (including em- 
bedded sentential clauses with their own information structure). The simplified 



outline of this right linear grammar is given as a finite state machine. The recur- 
sive behavior of this finite state machine comes from the fact that the individual 
argument or adjunct constituents can also embed sentential clauses. The details 
of the feature structures for sentential clauses are very similar to the structure 
for the case-frame. Thus, when an argument or adjunct, which is a sentential 
clause, is to be realized, the clause is recursively generated by using the same set 
of transitions. 

The outline of a grammar for generating predicative sentences can be given by 
a recursive finite state machine (FSM). Before proceeding to this FSM, in order 
to ease the understanding of it, a simpler example in a simpler domain will be 
given. In this simple domain, the only arguments of the verb are subject and 
direct object, there is no adjunct, and the only constituent of the information 
structure is the focus. The default word order is: 

subject, direct object, verb. 

The constituent which is to be emphasized, the focus, moves to the immediately 
preverbal position. So, in the following example, the first sentence is in the de- 
fault order, and in the second sentence, the subject, 'Ali', is the focus: 

(83) a. Ali topu atti. (Default Ord.) 

Ali ball+ACC throw+PAST+3SG 

'Ali threw the ball. ' 

b. Topu Ali atti. ('Ali' is focus) 

ball+ACC Ali throw+PAST+3SG 

'It was Ali, who threw the ball. ' 

The outline of a grammar for this simple domain can be given with the finite 
state machine in Figure 4.1. 

In this finite state machine, the transition from the initial state to state 1, 
labeled Subject generates the subject, if it is defined and it is not the focus. 
Otherwise, a NIL transition is done, which generates an empty string. Then, the 



SUBJ 




Focus=*undef* or 
Focus=Dir-obj or 
Focus=Verb 



Figure 4.1: The finite state machine for giving the outiine of a grammar for the 
simple domain. 

transition from state 1 to state 2, labeled Dir-obj, generates the direct object, if 
it is defined. The transition from state 2 to state 3, labeled Subject, generates 
the subject if it is the focus. Finally, the verb is generated with the transition 
from state 3 to the final state, labeled verb. 

Fxtending this simple domain to cover all the arguments of the verb, adjuncts, 
and information structure constituents, we get the finite state machine given in 
Figures 4.2 and 4.3 to give an outline of our grammar for generating predicative 
sentences. 

In this finite state machine, transitions are labeled by constraints and con- 
stituents (shown in bold face along a transition arc) which are generated when 
those constraints are satisfied. If any transition has a NIL label, then no surface 
form is generated for that transition. The transitions from state to state 1 
generate the constituent which is the topic. If the topic is not defined, then the 
NIL transition is taken, which generates an empty string. The other transitions, 
from state 1 to state 14 generate the constituents in the default order. The tran- 
sitions from state 14 to 15 generate the constituent which is the focus, and the 
constituent which is the background is generated by the transitions from state 17 
to the final state. 



NIL 

Topic=Path or 
Focus=Path or 
Backgromd=Path or 
Path=*undefined* 



NIL NIL 

T(ipk=DuratUm or Topk=Ma!mer or 
Focus=Duralionor Focus=Mamer or 
Backgromd=Duration Backgromd=Manncr 
or Mmner=*mdejined* 
Duration= *mdefmed* 



Subject 

'ckground= 
Time 

'dckground^Tim 
BackgrSi^ 




\Verb^ 
Focus=Verb 



X^__NIL^ 
Background= *undefined* 
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When the following case-frame is sent to the generator, since it has no CON- 
TROL feature, and so no information structure, the constituents are generated in 
the default order, which is the sentence below the case-frame. 



S-FORM finite 

CLAUSE-TYPE predicative 

VOICE active 
SPEECH-ACT 



VERB 



ARGUMENTS 



ADJUNCTS 



declarative 

ROOT 

SENSE 

TENSE 



#birak 

positive 
past 



ASPECT perfect 
SUBJECT jAhmet 
DIR-OBJ jkitap 
LOCATION masa 

TIMEUiin' 



(84) Ahmet diin kitabi masada birakti. 

Ahmet yesterday book+ACC table+LOC leave+PAST+3SG 

'Ahmet left the hook on the table yesterday. ' 

Considering the content, the following case-frame is the same as the previ- 
ous one. However, this case-frame has a CONTROL feature, so an information 
structure, which expresses that expression of time is topic, subject is focus, and 
location is background. Therefore, the sentence generated when this case-frame 
is sent to the generator is not in the default order, which is the sentence below 
this case-frame. Figure 4.4 demonstrates the transitions done on the finite state 
machine to generate this sentence. 
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S-FORM finite 
CLAUSE-TYPE predicative 



VOICE 

SPEECH-ACT 



VERB 



ARGUMENTS 



ADJUNCTS 



CONTROL 



active 
declarative 

ROOT 

SENSE 

TENSE 



#birak 

positive 
past 



ASPECT perfect 
SUBJECT jAhmet 
DIR-OBJ jkitap 
LOCATION ^ 



masa 



TIME diin 



IS 



TOPIC time 
FOCUS subject 
BACKGROUND location 



(85) Diin kitabi Ahmet birakti masada. 

yesterday book+ACC Ahmet leave+PAST+3SG table+LOC 

'It was Ahmet who left the book yesterday on the table. ' 



dUn 



To^fc= *undefined* 



Subject \ 

' ^opic—Subj&ct 
Time 




Location _ Ti>^}.U =-Lr}< 
" ~- ~~ JnstriinienJ - - ' 
■ ^ Topic=lnstrument , 
"~ - ^ Value ' 
Topic— Value 

^ ~ - Path- ' ' 
Topic=Path 

\ ^ ^ ^ J)uration, - ' 
Topic=Duration ' 

\ ^--^Mannec' 
\ T<>pic= Manner / 

" - Verb , - ' ' 
Topic=Verb 



Topic=Subject or Tapic=Titne ar Topic=Place or 

FoCUS=SubjeCt or Fiiciis=Tiine ar Feicus=Plac/: or 

BackgrouruI=Subject Backgraiiiul=Tiine or Biii:kground=Flac. 

Subject= *undefined* 



Toplc=Dir-obJ or Toplc=Beneftciary or Toplc=Source or 

Focus=Dir-obJ or Focus=Beneftciary or Focus=Source or 

Backgroiirid=Dir-obJ or Background=BenEflciary BackgrouHd=Sour 
Dir-obJ=i-d^niu 



Dir-obj = *uruieftned* 



Ben^ciary= *undeflned* Source= *uiuleflned' 



Toplc=Goal or 
Focus=Goal or 
Background=Goal o 
Goal= "'undefined* 



Topic=Locatlon or Topic=Instrurnent o 

Focus=Localion or Focus=InstruiHeni c 

Backgrourui=Locarion Backgroiaid=Instrur, 

Location = *iaideflned* 



ahmet 

/' NIL 

Focus= ^undefined* 



masada 

Subject 

Uackground=Subject 
Time 

Background ^Ti^iic ^ 

3ack};i(ju iui —Place 




" J>urati(HL ' " 

Focus=i>uration ' 

^ ^ ^ Mamiei: ' ,' 
Focus=Manner, ' 

^v^Verb 
Focus=Verb 



-Path- 

. ~v Background— Path^' ■ 

\ ^ ^ ^ J>uratiim- ' / 
^ ^Background=Duration ' 

^ Mamiei: ' ,' 
^ Sackground=MannJr 

Background= ^undefined* 



Existential Sentences 



We use the following general case-frame feature structure to encode the contents 
of an existential sentence: 



S-FORM 

CLAUSE-TYPE 

VOICE 

SPEECH-ACT 



QUES 



VERB 

ARGUMENTS 
ADJUNCTS 



CONTROL 



infinitive/adverbial /participle /finite 
existential 

active /reflexive /reciprocal /passive/causative 
imperative /optative /necessitative /wish / 
interrogative/declarative 

TYPE yes-no/wh 
CONST list-of(subject/dir-obj/etc.) 

ROOT verb 

POLARITY negative/positive 

TENSE present/past /future 

ASPECT progressive/habitual/etc 

MODALITY potentiality 

SUBJECT c-name 

TIME c-name 
PLACE c-name 

TOPIC constituent 
IS FOCUS constituent 
BACKGR constituent 



ES 



EVEN const ituent/poss-constituent 
TOO constituent/poss-constituent 
QUES constituent/poss-constituent 



The value of the CONTROL | IS feature is again the information structure of the 
sentence. This case-frame differs from predicative sentences considering only ar- 
guments and adjuncts. Figure 4.5 gives an outline of our grammar for generating 
existential sentences. 



time 



3 

cm' 

Hi 
Or 



o 

B' 

Hi 



o 




NIL 

Topic = place or 
Background = place or 
place = *undefined* 



NIL 

Sub] = "^undefined* 

Focus = *undefined* 



BG = *undefined* 



Attributive Sentences 

We use the following general case-frame feature structure to encode the contents 
of an attributive sentence: 



S-FORM 

CLAUSE-TYPE 

VOICE 

SPEECH-ACT 



QUES 



VERB 



infinitive/adverbial /participle /finite 
attributive 

active /reflexive /reciprocal /passive/causative 
imperative /optative /necessitative /wish / 
interrogative/declarative 

TYPE yes-no/wh 
CONST list-of(subject/dir-obj/etc.) 

ROOT verb 

POLARITY negative/positive 

TENSE present/past /future 

ASPECT progressive/habitual/etc 

MODALITY potentiality 

SUBJECT c-name 
PRED-PROPERTY c-name 

TIME c-name 
PLACE c-name 

TOPIC constituent 
IS FOCUS constituent 
BACKGR constituent 

EVEN const ituent/poss-constituent 
ES TOO constituent/poss-constituent 
QUES constituent/poss-constituent 



Figure 4.6 presents an outline of our grammar for generating attributive sen- 
tences. The CONTROL | IS feature in the case-frame, the information structure, 
is used to generate the appropriate surface order. 



ARGUMENTS 



ADJUNCTS 



CONTROL 



3 

cm' 

Hi 



B 

o 

B' 

Hi 



o 



Subject 

Topic = Subject 



Time 




NIL 

Topic = *undefined* 



NIL 

Subject = *undefined* or 
Topic = Subject or 
Focus = Subject or 
Background = Subject 



Time = *undefined* or 
Topic = time or 
Focus = time or 



Subject 

Backervund = Subject 




NIL 

Place = *undefined* or 
Topic = Place or 
Focus = Place or 
Background = Place 



NIL 

Focus = *undefined* 



NIL 

Background = *undefined* 



Di 



4.1.3 Complex Sentences 



Complex sentences are combinations of simple sentences (or complex sentences 
themselves) which are linked by either conjoining or various relationships like 
conditional dependence, cause-result, etc. The generator works on a feature 
structure representing a complex sentence which may be in one of the following 
forms: 



• a simple sentence. In this case the sentence has the case-frame as its feature 
structure. 

• a series of simple or complex sentences connected by coordinating or brack- 
eting conjunctions. Such sentences have feature structures which have the 
individual case-frames as the values of their ELEMENTS features: 

TYPE conj 
CONJ and/or/etc. 
ELEMENTS list-of( complex-sentence) 

• sentences linked with a certain relationship. Such sentences have the feature 
structure: 

TYPE linked 
LINK-RELATION rel 
ARGl complex-sentence 
ARG2 complex-sentence 

The case-frame of the following complex sentence, which is formed from two 
simple sentences linked by cause-result relationship, is given below it: 

(86) Sen geldigin icin o gitti. 

you come+PART+P2SG because he go+PAST+3SG 

'He went because you came. ' 



TYPE 

LINK-RELATION 
ARGl 

ARG2 

4.1.4 Generating Noun Phrases 

This subsection describes the generation of Turkish noun phrases and the feature 
structures (our input) that are used to denote them. 

The Design of a Semantic Representation for Turkish Noun Phrases 

1. The Semantic Features for Common Nouns The basic semantic infor- 
mation for concepts, needed in the generation of Turkish noun phrases are (with 
possible values are given in the parentheses): 

temporal (-|-/-), 

container (-|-/-), 

measure (-|-/-), 

countable (-|-/-), 

material (-|-/-). 

These are needed only if the head is a common noun. These are available in the 
lexicon entries associated with the noun. 

This semantic information is percolated from the head of the noun phrase to 
the whole noun phrase. Some of these features are not given in the input, but 
they can be obtained from the lexicon when needed. The usage of these features 
can be given as: 

• The temporal feature is needed when determining the case of the noun 
phrase which expresses the location information (the specifying relation). 



linked 
cause-result 

CASE-FRAME FOR 
THE SENTENCE "Sen geldin.' 

CASE-FRAME FOR 
THE SENTENCE "0 gitti." 



• The head of the noun phrase specifying a quantitative modifier of type 
container-full (explained in the following sections) should have the property 
of being a container, though this may be relaxed. The container feature is 
needed to ensure this, when necessary. 

• In a similar way, the head of the noun phrase specifying a quantitative 
modifier of type measure (explained in the following sections) should have 
the property of being a unit, so measure feature is used to ensure this. 
This feature is also needed to ensure the absence of the word 'adet', 'tane', 
or 'parca' between a cardinal and a unit. 

• The countability information is needed when comparing the countability 
of the head with that required by the quantifier. 

• The material feature is used in a similar way with the container and measure 
features. The head of the noun phrase specifying a modifying relation (which 
is a noun phrase followed by the suffix +DAn) should have the property of 
being a material (although this may also be relaxed). It is needed to ensure 
this. 

The generator needs the semantic information above for the whole noun phrase 
and information about its constituents, in order to generate the noun phrase. 
Besides these, it also needs emphasis information, to determine the word order. In 
the absence of any emphasis information, the generator generates the constituents 
in the default order. 

In the following subsections, the feature structures for denoting noun phrases 
and their constituents are given. 
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Proposed Feature Structures for the Noun Phrase 



We propose the following feature structure to describe the semantic structure of 
a noun phrase: 



c-name 



REFERENT 



ARG 



CONTROL 



list-of ( c-name-no-spec-no-mod) 
b-con 

DROP +/- (default - 



MODIFIER 



ORDINAL 



CLASSIFIER c-name-no-spec-qual 
ROLES role-type 

MOD-REL list-of (m-^?/pe) 

ORDER ilk/sonuncu/birinci/etc. 
INTENSIFIER +/- (default -) 

QUAN-MOD q-type 
QUALITATIVE list-of( S-prop) 

CONTROL EMPHASIS quantitative/ qualitative 

SET-SPEC list-of (c-name) 

RELATION dair/ait /location/etc. 

SPEC-REL 

ARGUMENT list-of( C-name) 

list-of ((i-^?/pe) 

QUANTIFIER her/bazi/etc. 

DEFINITE +/- 

REFERENTIAL +/- 

SPECIFIC +/- 

ARGUMENT list-of( C-name) 

DROP +/- (default -) 
MOVE +/- (default -) 



SPECIFIER 



DEM 



)UAN 



POSSESSOR 



CONTROL 



C-name is the type of the feature structure which denotes a noun phrase, c- 
name has a number of subtypes depending on the constraints on some of its 
features. The hierarchy of structures which are constrained forms of c-name is 



DO 



c-name 



c-name-no-spec 
c-name-no-spec-qual 
c-name-no-spec-no-mod 
c-name-no-spec-no-mod-no-class 

Figure 4.7: The hierarchy of subtypes of c-name 
as in Figure 4.7. The subtypes of c-name are defined as follows: 



1. If a noun phrase is not specified by any specifiers, then it has a structure of 
type c-name-no-spec. This structure has the following constraints on it: 



c-name-no-spec 



SPECIFIER NIL 
POSSESSOR NIL 



C-name 

2. If a noun phrase is not specified by any specifiers, and furthermore it is 
not modified by any modifiers except the qualitative modifier, then it has 
a structure of type c-name-no-spec-qual. In this case, it has the constraints 
on it, which can be shown in the following feature structure: 



c-name-no-spec-qual 



MODIFIER 



MOD-REL NIL 
ORDINAL NIL 

SPECIFIER NIL 
POSSESSOR NIL 



C-name l 

3. If a noun phrase is not specified by any specifiers, and not modified by 
any modifiers, then it has a structure of type c-name-no-spec-no-mod. This 
structure has the constraints on it, which can be given with the following 
feature structure: 



c-name-no-spec-no-mod 



c-name 



MODIFIER NIL 
SPECIFIER NIL 
POSSESSOR NIL 



4. If the structure of a noun phrase has only the REFERENT feature, and has a 
h-con (basic-concept, explained in detail in the following sections) structure 
as its value, then the type of its structure is c-name-no-spec-no-mod-no- 
class. The constraints on this structure can be given as below: 



c-name-no-spec-no-mod-no-class 



c-name 



REFERENT 



ARG b- 



con 



CLASSIFIER NIL 

MODIFIER NIL 

SPECIFIER NIL 

POSSESSOR NIL 



The value of MODIFIER | CONTROL | EMPHASIS feature in a c-name structure 
is used to determine the order of quantitative and qualitative modifiers in the 
surface form. The value of this feature is closer to the head of the noun phrase 
in the surface form. 



The value of a feature may be a list of structures, as explained in Chapter 3. 
This is shown as list-of(str) in the attribute- value matrix above, which is equiv- 
alent to the following feature structure: 



list-of (str) 



ELEMENTS (^str, str, ... yStr 
CONJ and/or/none 



The feature structures for the modifiers, specifiers and the classifier of the noun 
phrase are: 



1. Set Specifier The set specifier is also a noun phrase and the SET-SPEC 
feature has a structure of type c-name as value. Therefore, it has a structure 
hke: 

SET-SPEC hst-of( C-name) 

2. Possessor The possessor is a list of the structures of type c-name. Some- 
times, the possessor can be eliminated from the surface form as explained in 



Chapter 3, then some additional information regarding whether it can be omitted 
on the surface form may also be given. In a c-name structure, this information is 
given with the value of the POSSESSOR | CONTROL | DROP feature. If this feature 
has the value -, then the possessor can not be omitted, otherwise it is omitted. 
If no such information is given, then by default, this means that the possessor 
can not be omitted. 

Furthermore, the possessor may change its position. The information about 
the movement of the possessor is given in a c-name structure with the POSSES- 
SOR I CONTROL I MOVE feature. If this feature has + as value, then the possessor 
may move to a position behind the head in the surface form. So, MOVE can be 
used to encode possessor scrambling, when needed. The structure of possessor is 
given as the value of the feature POSS, and looks like: 



POSSESSOR 



ARGUMENT list-of{ C-name) 

DROP +/- (default 
MOVE +/- (default 



CONTROL 



3. Specifying Relation If specifying relation is present, it has the structure: 



SPEC-REL 



RELATION dair /ait /location 
ARGUMENT list- oi{ C-name) 



The RELATION feature gives the relation of the argument noun phrase (the value 
of the ARGUMENT feature) with the head. If the relation is given by a post- 
position, then the RELATION feature has that postposition as its value. The 
RELATION feature has as value 'location', if the specifying relation mentions a 
temporal or spatial location. In such the information whether the argu- 

ment noun phrases should be in locative or nominative case is obtained from the 
value of SEM I TEMPORAL features of their referents. 



4. Demonstrative Specifier The demonstrative specifier is a list of the Turk- 
ish demonstrative specifiers. If it is present, it has a structure like the following: 
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DEM list-of{ d-type) 



where the structure of type d-type is the following: 



d-type 



DEMONS bu/su/o 



5. Quantifier The quantifier is represented by the following structure: 



QUANTIFIER her/bazi/etc. 

DEFINITE +/- 

QUAN 

REFERENTIAL +/- 

SPECIFIC +/- 

The QUANTIFIER feature has as value the root of the quantifier. If the quan- 
tifier feature has NIL as value, then the DEFINITE, REFERENTIAL, and SPECIFIC 
features determine the presence of the indefinite article 'bir' (a/an). Furthermore, 
these features, together with the role of the noun phrase, determine the case and 
the position of noun phrase in the sentence. 



6. Modifying Relation A noun phrase may have more than one modifying 
relations, so the MOD-REL feature of a concept has as value a list of modifying 
relations. The feature structures which are the elements of this list are of m-type^ 
which has the following structure: 

RELATION gibi /kadar / made-of / with /etc. 
ARGUMENT list- oi{c- name) 

m-type^ 

If an element of the modifying relation list is a postpositional phrase, where 
the postposition gives the relation of some other concepts with the head, the RE- 
LATION feature has that postposition as its value. If an element of the modifying 
relation list is an adjectival phrase made from a noun phrase by the suffix: 



• +DAn, then the value of RELATION feature is 'made-of. The argument noun 



phrases must have the property of being a material, but this may also be 
relaxed. 

• +1H, then the value of RELATION feature is 'with'. The argument has on it 
the constraint that it should be a list of structures of type c-name-no-spec. 

• +sHz, then the value of RELATION feature is 'without'. The argument has 
on it the constraint that it should be a list of structures of type c-name-no- 
spec-qual. 

• +DA, then the value of RELATION feature is 'made-on'. The argument has 
on it the constraint that it should be a list of structures of type c-name-no- 
spec-qual. 

• +lHk, then the value of RELATION feature is 'of. However, in this case the 
value of the ARGUMENT feature is not a list of structures of noun phrases, but 
it is the structure of a quantitative modifier of type measure (the structure 
of a quantitative modifier of type measure will be explained when explain- 
ing quantitative modifiers). But the value of the APPROX feature in this 
structure should be -, because the quantitative modifier should not have the 
word 'civarmda' as the head. 



7. Ordinal The ordinal also has a structure similar to the demonstrative spec- 
ifier or quantifier. When it is present, it has the structure: 



ORDINAL 



ORDER ilk/sonuncu/birinci/etc. 
INTENSIFIER -K/- (default -) 



8. Quantitative Modifier The QUAN-MOD feature contains the quantitative 
modifier. In Turkish, quantity information can be expressed in several forms, as 
described in Chapter 3. The type of the feature structure (q-type) which gives 
the quantity information can be one of number, measure, container-full or fuzzy- 
quantity^ with the following structures: 



1. If the quantity is expressed by a cardinal alone, as in the following example. 



(87) dort elma 
four apple 

'four apples' 

then the feature structure corresponding to the quantitative modifier of this 
noun phrase will be as follows: 



number 



LOW 4 
HIGH NIL 

FORMAL-CARD - (default 
CONTROL FORMAL-LOW - (default 
FORMAL-HIGH - (default 



Here, the value of the QUAN-MOD | CONTROL | FORMAL-CARD feature deter- 
mines whether the cardinal should be followed by one of the words: 'adet', 
'tane', or 'parca'. In the absence of this information, the cardinal is not fol- 
lowed by any of these words. The values of the features QUAN-MOD | CON- 
TROL I FORMAL-LOW and QUAN-MOD | CONTROL | FORMAL-HIGH determine 
the presence of the words 'en az' (at least) and 'en cok' (at most), respec- 
tively. These words are used in order to specify limits. 

2. If the quantity is expressed by a range of cardinals, as in the following 
example, 

(88) iki lie elma 
two three apple 

'2 to 3 apples' 

then the feature structure corresponding to the quantitative modifier of this 
noun phrase will be like: 



number 



LOW 
HIGH 

CONTROL 



FORMAL-CARD 

FORMAL-LOW 

FORMAL-HIGH 



- (default 

- (default 

- (default 



Here the roles of QUAN-MOD | CONTROL | FORMAL-CARD, QUAN-MOD | CON- 
TROL I FORMAL-LOW and QUAN-MOD | CONTROL | FORMAL-HIGH features 
are the same as in the case of cardinal. If the value of QUAN-MOD | CON- 
TROL I FORMAL-LOW feature was -|- then (89a) would be generated from 
the above feature structure, whereas if the values of QUAN-MOD | CON- 
TROL I FORMAL-LOW and QUAN-MOD | CONTROL | FORMAL-HIGH features 
were both -|-, then (89b) would be generated from it: 

(89) a. en az iki tie 

most less two three 

'at least two to three ' 

b. en az iki en cok tie 
most less two most much three 

'at least two, at most three ' 

3. If the quantity is expressed by a noun phrase whose head is a measure noun 
as in example (53c) of Chapter 3, then the corresponding feature structure 
will be: 



measure 



QUANTITY number 

UNIT c-name-no-spec-no-mod-no-class 
APPROX +/- (default -) 



The value of the QUAN-MOD | APPROX feature, which is by default - in the 
absence of this feature, determines whether the word 'civarmda' (meaning 
'about') should be present. 

So, the MODIFIER | QUAN-MOD feature of the noun phrase 



(90) a. bir iki kilo elma 
one two kilo apple 

'1 to 2 kilos of apples ' 

will have as its value, the following feature structure: 



measure 



QUANTITY 

number 

UNIT 

c-name 



LOW 1 
HIGH 2 



REFERENT 



ARG 
h-con 



CONCEPT #kilo 



SEM 



MEASURE + 



4. If the quantity is expressed by a noun phrase, whose head is a container, as 
in example (53a) of Chapter 3, the feature structure which is the value of 
the quantity feature, will be similar to the above two. The only difference 
will be the type of the feature structure, this time it will be container-full 
instead of measure, and there will not be a QUAN-MOD | APPROX feature. 
So, the corresponding feature structure will be as follows: 



container-full 



QUANTITY number 

UNIT c-name-no-spec-qual 



5. If the quantity is expressed by a fuzzy quantity, as in example (52) of Chap- 
ter 3, then the corresponding part of the feature structure of such a noun 
phrase will be like: 



fuzzy-quantity 



F-QUAN cok 



CONTROL 



FORMAL-QUANTITY +/- (default -) 



Some of these adjectives may also be followed by one of the words 'miktarda' 
(with uncountable heads) or 'sayida' (with countable heads). The value of 
the feature CONTROL | FORMAL-QUANTITY is used when determining the 
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presence of the word 'miktarda' or 'sayida'. If it is +, then one of these 
words should be present. 



9. Qualitative Modifier The head of a noun phrase may be modified by more 
than one qualitative modifiers. Therefore the QUALITATIVE feature of a concept 
is a list of individual qualitative modifiers. The elements of this list are of type 
simple property (i.e. adjective), which has the following structure: 



s-prop 



P-NAME 
INTENSIFIER 



basic-property 

i-type 



In order to be modified by an intensifier, the basic-property should be gradable 
(i.e. it should have the value + for its GRADABLE feature). The intensifiers of 
the simple properties can be adverbs mentioning the degree of that property, as 
in example (59a) (Chapter 3), or they can be postpositional phrases comparing 
that property with another, as in example (59b) (Chapter 3). Therefore, the type 
of the structure for the intensifier [i-type) can be: degree or p-comparative. These 
two have the following structures, respectively: 



degree 



DEGREE cok/en/dah /etc. 



p-comparative 



COMPARATOR daha/kadar 
ARG c-name 



The intensifier of type degree can be 'acik' (light) or 'koyu' (dark), if it is modi- 
fying a color (i.e. the basic-property has the value + for its COLOR feature). 



10. The Classifier and the Head The head of a noun phrase is given as 
the value of the REFERENT feature, and the classifier is given as the value of the 
CLASSIFIER feature, which has the structure of c-name with certain constraints 
on it. Therefore, the REFERENT and CLASSIFIER features have the following 
structures: 
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c-name 



REFERENT 



ARG 



CONTROL 



list-of^c-name-no-spec-no-mod) 
b-con 



DROP +/- (default -) 
CLASSIFIER c-name-no-spec-qual 



The referent may either have a hst of c-name-no-spec-no-mod structures or a b- 
con structure. The structure of a typical b-con is: 



CONCEPT #k6pek 



b- 



con 



SEM 



TEMPORAL 


+ / 


CONTAINER 


+ / 


MEASURE 


+ / 


COUNTABLE 


+ / 


MATERIAL 


+ / 



The value of the REFERENT | CONTROL | DROP feature determines whether 
the head will drop or not. If this feature has a value +, then the head will drop, 
as explained in Chapter 3. This feature has a value - by default. 

The following is an example to a complex feature structure: 



REFERENT 



CLASSIFIER 



REFERENT 



CLASSIFIER 

c-name 



ARG I CONCEPT #oran 

h-con 



REFERENT 



ARG OF TYPE b-con I 
WITH CONCEPT #komisyon | 



REFERENT 



CLASSIFIER 

c-name 



ARG CONCEPT #kart 

b-conl J 



REFERENT 



ARG I CONCEPT #kredi 

b-con 



which is the feature structure of the noun phrase: 



(91) a. kredi karti komisyon orani 

credit card+P3SG commission rate+P3SG 

'credit card commission rate ' 

and the feature structure of the noun phrase: 



(92) a. komisyon orani 

commission rate+P3SG 

'commission rate ' 



can be given with the following feature structure: 



c-name 



REFERENT 

CLASSIFIER 

C-name 



ARG 



b-con 



CONCEPT #oran 



REFERENT 



ARG 



b- 



coni 



CONCEPT #komisyon 



Roles 



The sentential modifiers of the noun phrase and the noun phrases which are 
sentential clauses are linked to our feature structure denoting noun phrases, by 
the ROLES feature. 

The sentential modifiers of the head are gapped sentential clauses, where the 
head is the filler of the gap. Therefore, if the head has a sentential modifier, the 
ROLES feature in the feature structure for this noun phrase has as value a feature 
structure which can be given as follows: 

ROLE agent /patient /theme/etc. 
ARG case-frame 

Here, the ROLE feature has as value the role of the head, the gapped constituent 
in the gapped sentential clause and the ARG feature has the case-frame for the 
sentential clause. The case-frames for sentential clauses are very similar to the 
case-frames for sentences. 



The noun phrases which are full sentential clauses are acts, facts and adver- 
bials. If the noun phrase is a full sentential clause, then the feature structure 
denoting it has only the ROLES feature with the following feature structure as 
value: 

TYPE ind-act/def-act/fact/etc. 
ARG case-frame 

The TYPE feature gives the type of the sentential clause and the ARG feature 
contains the case-frame for the sentential clause. 

The form of the verb of the sentential clause is determined using the ROLE 
feature with sentential modifiers and TYPE feature with full sentential clauses 
which act as noun phrases. 



Some Example Feature Structures 

The following are some examples for the feature structure of a noun phrase: 



• yedi adamdan ikisi 
seven man+LOC two+P3SG 

'two of the seven men' 



REFERENT 



MODIFIER 



ARG 



h-con 

CONTROL pROP 
QUAN-MOD 



^CONCEPT #adam 



LOW 2 
HIGH NIL 



SPECIFIER 



SET-SPEC 



REFERENT 



MODIFIER 



ARG CONCEPT #adam 

h-r.nnX i 



h-con 

QUAN-MOD 

number 



LOW 7 
HIGH NIL 



c-name 



• at kadar biiyiik bir kopek 
horse as big a dog 

'a dog as big as a horse ' 



REFERENT 



SPECIFIER 



CONCEPT #kopek 
I L J 



QUALITATIVE 



#buyuk 
COMPARATOR kadar 



INTENSIFIER 



QUAN 



ri-comriarative 
s-prop L ^ ^ 

QUANTIFIER NIL 
DEFINITE 
REFERENTIAL + 
SPECIFIC 



A c-name STRUCTURE 
WITH A REFERENT OF 
TYPE b-con WITH 
CONCEPT #at 



Problems in Generating Noun Phrases 



The surface form of a noun phrase with the following feature structure: 



REFERENT 



MODIFIER 



ARG I CONCEPT #adam 

b-con 



QUAN-MOD 

number 



with 



LOW 2 
HIGH NIL 



ARGUMENT 



REFERENT 



ARGUMENT OF TYPE b-con 
WITH CONCEPT #kalem 



can be both of the below phrases, depending on the emphasis information: 



(95) a. iki, kalemli adam 
two pencil+WITH man 

'two men with a pencil' 
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b. kalemli iki adam 

pencil+WITH two man 

'two men with a pencil' 

But if no comma is generated in the first surface form, it will be ambiguous, 
because it is the surface form of the following feature structure also: 



REFERENT 



MODIFIER 



ARG I CONCEPT #adam 

b-con 



with 



ARGUMENT 



REFERENT 



I ARGUMENT OF TYPE b-con I 
I WITH CONCEPT #kalem I 

I QUANTITATIVE MODIFIER OF TYPE | 
I number WITH LOW 2 I 



Therefore, if the generator receives no emphasis information, it always gener- 
ates the second surface form, for the first feature structure, because generating 
a comma between the modifying relation and the quantitative modifier, in order 
to eliminate this ambiguity, is not very good stylistically. 

A similar ambiguity is also present in the surface form corresponding to the 
following feature structure. This phrase has a qualitative modifier and a modi- 
fying relation: 



c-name 



REFERENT 



MODIFIER 



ARG 



b-con 



CONCEPT #masa 



QUALITATIVE 

s-prop 



P-NAME #beyaz 



MOD-REL 



made-of 



ARGUMENT < 



A STRUCTURE OF 
TYPE C-name WITH 
REFERENT #tahta 



And the corresponding surface forms are: 



(96) a. beyaz, tahtadan masa 
white wood+ABL table 

'white table made of wood' 

b. tahtadan beyaz masa 
wood+ABL white table 

'white table made of wood' 



The first of these surface forms can also be ambiguous if the comma is not present 
in the surface form, because it also corresponds to the following feature structure: 



REFERENT 



CONCEPT #masa 



MODIFIER 



MOD-REL 



made-of 



ARG UMENT 



A STRUCTURE OF TYPE c-name 
WITH REFERENT #tahta AND THE 
< REFERENT IS MODIFIED BY A 

QUALITATIVE MODIFIER OF TYPE s-'pro'p 
WITH P-NAME #beyaz 



The generator should generate the surface forms that are not ambiguous. In 
order to realize this, 



• If there is no emphasis information, the set of modifying relations precede 
the quantitative modifier in the surface form, 

• If there is emphasis information, there will be a comma between each element 
of the set of modifying relations and between the modifying relations and 
quantitative modifier. 



4.2 Grammar Architecture 



Our generation grammar is written in a formalism called Pseudo Unification 
Grammar implemented by the GenKit generation system [22], developed at Car- 
negie Mellon University-Center for Machine Translation. In the following subsec- 
tions, information about GenKit, and then some example rules of our grammar 
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will be given. 

4.2.1 GenKit 

Generation Kit (GenKit) is a system which compiles a grammar into a sen- 
tence generation program. The grammar formalism used by GenKit is also called 
Pseudo Unification Grammar [22]. Each rule of the grammar consists of a context- 
free phrase structure description and a set of feature constraint equations^ which 
are used to express constraints on feature values. Non-terminals in the phrase 
structure part of a rule are referenced as xO , . . . ,xn in the equations, where xO 
corresponds to the non-terminal in the left hand side, and xn is the n^^ non- 
terminal in the right hand side. 

To implement the sentence level generator (described by the finite state ma- 
chine presented earlier), we use rules of the form: 

S^' ^ XP Sj 

where the and Sj denote some state in the finite state machine and the XP 
denotes the constituent to be realized while taking the transition between states 
Si and Sj, labeled XP. By the feature constraint equations, the corresponding part 
of the feature structure which was assigned to previously, is assigned to XP, 
and the remaining is assigned to Sj. If this XP corresponds to a sentential clause, 
the same set of rules are recursively applied. This is a variation of the method 
suggested by Takeda et al. [21]. By this kind of rules, there is no need to write 
a separate rule for each possible constituent order. 

In generation, non-determinism, producing multiple surface forms for a given 
input, is a serious problem. If there is no style-related information (an information 
structure) in the input, our generator generates the sentence in a default order. 
If there was no such a default order, non-determinism would have been a big 
problem, because Turkish is a free-constituent order language. For a given input 
of two arguments plus the verb (without no information structure) 6 surface 
sentences (all combinations) might be generated. 



8i 



Since the context-free rules are directly compiled into tables, the performance 
of the system is essentially independent of the number of rules, but depends on 
the complexity of the feature constraint equations (which are compiled into LISP 
code). Currently, our grammar has 273 rules (excluding lexical rules), each with 
very simple constraint checks. Of these 273 rules, 133 are for sentences and 107 
are for noun phrases, and the remaining are for adverbials and verbs. 



4.2.2 Example Rules 

Figure 4.8 demonstrates the transitions from state to state 1 of our finite state 
machine for generating predicative sentences. The following are the simplified 

NIL 




Figure 4.8: Transitions from state to state 1 of our finite state machine for 
generating predicative sentences. 
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forms of our rules for the top three transitions in Figure 4.8: 



( 

((xO control topic) =c *undefined*) 
(xl = xO)) 

) 



(<S> <==> (<Subject> <S1>) 
( 

((xO control topic) =c subject) 
(x2 = xO) 

((x2 arguments subject) = *remove*) 
(xl = (xO arguments subject))) 

) 



(<S> <==> (<Time> <S1>) 
( 

((xO control topic) =c time) 
(x2 = xO) 

((x2 adjuncts time) = *remove*) 
(xl = (xO adjuncts time))) 

) 



The first rule above is for the NIL transition, this transition is done if the topic 
is not defined. The second rule is for the transition labeled Subject, if the topic 
is subject, then this transition is done. In the feature constraint equations, it 
is checked whether the subject is the topic, and if so, the part of the feature 
structure for subject is assigned to <Subject>, and the remaining is assigned to 
<S1>. The third rule is for the transition labeled Time. 

The grammar also has rules for realizing a constituent like <Subject> or 
<Time> (which may eventually call the same rules if the argument is senten- 
tial) and rules like above for traversing the finite state machine from state 1 to 
the final state. 



4.3 Interfacing with Morphology 

As Turkish has complex agglutinative word forms with productive inflectional and 
derivational morphological processes, we handle morphology outside our system 
using the generation component of a full-scale morphological analyzer of Turkish 
[f7]. Within GenKit, we generate relevant abstract morphological features such 
as: 

• agreement, 

• possessive, and 

• case 

markers for nominals and 

• voice, 

• polarity, 

• tense, 

• aspect, 

• mood, and 

• agreement 

markers for verbal forms. This information is properly ordered at the interface 
and sent to the morphological generator, which then: 

f. performs concrete morpheme selection, dictated by the morphotactic con- 
straints and morphophonological context, 

2. handles morphographemic phenomena such as vowel harmony, and vowel 
and consonant ellipsis, and 

3. produces an agglutinative surface form. 
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For example, the following feature structures are the outputs of our generator 
for nominal and verbal forms, respectively. These are sent to the morphological 
generator, which then performs morpheme selections and converts them into the 
intermediate forms below, and produces the agglutinative surface forms from 
these intermediate forms: 



CAT 


NOUN 


ROOT 


kalem 


AGR 


3SG 


POSS 


ISG 


CASE 


GEN 



i 

kalem+0+Hm+Hn 
i 



kalemimin 



CAT 


VERB 


ROOT 


gel 


SENSE 


PCS 


TAMl 


PROGl 


AGR 


ISG 



i 

gel+0+Hyor+Hm 
i 

geliyorum 

4.4 Comparison With Related Work 

There were two studies done on generation of Turkish sentences, previously. The 
first one is the M.Sc. thesis of Colin Dick [3], done in the Department of Ar- 
tificial Intelligence, University of Edinburgh, and the second one is the Ph.D. 
thesis of Beryl Hoffman, at the Computer and Information Science Department 
of University of Pennsylvania. 
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Dick [3] has worked on a classification based language generator for Turk- 
ish. His goal was to generate Turkish sentences of varying complexity, from 
input semantic representations in Penman's Sentence Planning Language (SPL). 
However, his generator is not complete, in that, noun phrase structures in their 
entirety, postpositional phrases, word order variations, and many morphologi- 
cal phenomena are not implemented. Our generator differs from his in various 
aspects: We use a case-frame based input representation which we feel is more 
suitable for languages with free constituent order. Our coverage of the grammar 
is substantially higher than the coverage presented in his thesis and we also use 
a full-scale external morphological generator to deal with complex morphological 
phenomena of agglutinative lexical forms of Turkish, which he has attempted 
embedding into the sentence generator itself. 

Hoffman, in her thesis [8, 9], has used the Multiset-Combinatory Categorial 
Grammar formalism [10], an extension of Combinatory Categorial Grammar to 
handle free word order languages, to develop a generator for Turkish. Her gener- 
ator also uses relevant features of the information structure of the input and can 
handle word order variations within embedded clauses. She can also deal with 
scrambling out of a clause dictated by information structure constraints, as her 
formalism allows this in a very convenient manner. The word order information 
is lexically kept as multisets associated with each verb. She has demonstrated 
the capabilities of her system as a component of a prototype database query 
system. We have been influenced by her approach to incorporate information 
structure in generation, but, since our aim is to build a wide-coverage generator 
for Turkish for use in a machine translation application, we have opted to use 
a simpler formalism and a very robust implementation environment. Our gen- 
erator also differs from her generator, in that: When the information structure 
for sentences (explained in detail in Chapter 3 of this thesis) is not present, we 
generate sentences in a default order (which is also given in Chapter 3), whereas 
her generator generates nothing in such a case. 

A concurrent study was also done by Turgay Korkmaz [14]. He used a func- 
tional linguistic theory called Systemic- Functional grammar, and the FUF text 
generation system to implement a sentence generator for Turkish. His genera- 
tor takes semantic descriptions of sentences and then produces a morphological 



description for each lexical constituent of the sentence. His generator does not 
handle long-distance scramblings, unbounded dependencies and discontinuous 
constituents, but is the first one to use the systemic functional approach in the 
context of Turkish. 



Chapter 5 

Example Outputs and Future 
Work 



In this chapter, we present some example outputs of our generator. In Appendix 
C, some additional examples from computer manuals are also given. 

5.1 Example Outputs 

In this section, some examples to demonstrate the work of our generator are 
given. 

Example 1: 
Input : 

; Adam elmayl kadlna verdi 

; This sentence is in the default order, so there will not be 
; a CONTROL feature. 

((s-form finite) 
(clause-type predicative) 
(voice active) 
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(speech-act declarative) 
(verb 
((root "ver") 
(sense positive) 
(tense past) 
(aspect perfect))) 
(arguments 
( (subj ect 
( (referent 
((arg 

((concept "adam"))) 
(agr ((number singular) 
(person 3))))))) 

(dir-obj 
( (referent 
((arg 

((concept "elma"))) 
(agr 

((number singular) 
(person 3))))) 
(specifier 
( (quan 

((definite +))))))) 

(goal 
( (referent 
((arg 

((concept "kadin"))) 
(agr 

((number singular) 
(person 3)))))))))) 

Output : 

[[CAT=NOUN] [ROOT=adam] [AGR=3SG] [POSS=NONE] [CASE=NOM]] — > adam 
[[CAT=NOUN] [ROOT=elma] [AGR=3SG] [POSS=NONE] [CASE=ACC]] — > elmayl 



tjj^AMt'Ltj uuiruib Aiyjj rui utttj wuttt^ 



[[CAT=NOUN] [ROOT=kadIn] [AGR=3SG] [POSS=NONE] [CASE=DAT]] — > kadlna 
[[CAT=VERB] [ROOT=ver] [SENSE=POS] [TAM1=PAST] [AGR=3SG]] — > verdi 
[PERIOD] — > . 



yu 



Example 2: 
Input : 

; Kadlna adam verdi elmayl 

; This sentence is not in the default order, the destination/recipient , 
; ''kadin'', is the topic, the subject, ''adam'', is the focus, 
; the direct object, ''elma'', is the background. 

((s-form finite) 
(clause-type predicative) 
(voice active) 
(speech-act declarative) 
(verb 
((root "ver") 
(sense positive) 
(tense past) 
(aspect perfect))) 
(arguments 
( (subj ect 
( (referent 
((arg 

((concept "adam"))) 
(agr ((number singular) 
(person 3))))))) 

(dir-obj 
( (referent 
((arg 

((concept "elma"))) 
(agr 

((number singular) 
(person 3))))) 
(specifier 
( (quan 

((definite +))))))) 



tjj^AMt'Ltj uuiruib Aiyjj rui utttj wuttt^ 



(goal 
( (referent 
((arg 

((concept "kadin"))) 
(agr 

((number singular) 
(person 3))))))))) 

(control 
((is 

((topic goal) 
(focus subject) 
(background dir-obj ) ) ) ) ) ) 

Output : 

[[CAT=NOUN] [ROOT=kadIn] [AGR=3SG] [POSS=NONE] [CASE=DAT]] — > kadlna 
[[CAT=NOUN] [ROOT=adam] [AGR=3SG] [POSS=NONE] [CASE=NOM]] — > adam 
[[CAT=VERB] [ROOT=ver] [SENSE=POS] [TAM1=PAST] [AGR=3SG]] — > verdi 
[[CAT=NOUN] [ROOT=elma] [AGR=3SG] [POSS=NONE] [CASE=ACC]] — > elmayl 
[PERIOD] — >. 



yz 



Example 3: 
Input : 

; Adam kitabi okumak istedi. 

; The direct object is a sentential clause representing an 
; indefinite act. 

((s-form finite) 
(clause-type predicative) 
(voice active) 
(speech-act declarative) 
(verb 
((root "iste") 
(sense positive) 
(tense past) 
(aspect perfect))) 
(arguments 
( (subj ect 
( (referent 
((arg 

((concept "adam"))) 
(agr 

((number singular) 
(person 3))))))) 
(dir-obj 
( (roles 

((role act) 
(arg 

((s-form inf-ind-act) 
(clause-type predicative) 
(voice active) 
(speech-act declarative) 
(verb 
((root "oku") 



tjj^AMt'Ltj uuiruib Aiyjj rui utttj wuttt^ 



(sense positive))) 
(arguments 
( (dir-obj 
( (referent 
((arg 

((concept "kitap"))) 
(agr 

((number singular) 
(person 3))))) 
(specifier 
( (quan 

((definite +)))))))))))))))))) 

Output : 

[[CAT=NOUN] [ROOT=adam] [AGR=3SG] [POSS=NONE] [CASE=NOM]] — > adam 
[[CAT=NOUN] [ROOT=kitap] [AGR=3SG] [POSS=NONE] [CASE=ACC]] — > kitabi 
[[CAT=VERB] [ROOT=oku] [SENSE=POS] [CONV=NOUN=MAK] [TYPE=INFINITIVE] 

[AGR=3SG] [POSS=NONE] [CASE=NOM]] — > okumak 
[[CAT=VERB] [ROOT=iste] [SENSE=POS] [TAM1=PAST] [AGR=3SG]] — > istedi 
[PERIOD] — > . 



Example 4: 
Input : 

; Adam kadlnin kitap okuduGunu zannetti. 

; The direct object is a sentential clause representing a fact. 

((s-form finite) 
(clause-type predicative) 
(voice active) 
(speech-act declarative) 
(verb 
((root "zanned") 
(sense positive) 
(tense past) 
(aspect perfect))) 
(arguments 
( (subj ect 
( (referent 
((arg 

((concept "adam"))) 
(agr 

((number singular) 
(person 3))))))) 
(dir-obj 
( (roles 

((role fact) 
(arg 

((s-form participle) 
(clause-type predicative) 
(voice active) 
(speech-act declarative) 
(verb 
((root "oku") 
(sense positive) 



tjj^AMt'Ltj uuiruib Aiyjj rui utttj wuttt^ 



(tense past))) 
(arguments 
( (dir-obj 
( (referent 
((arg 

((concept "kitap"))) 
(agr 

((number singular) 
(person 3))))) 
(specifier 
( (quan 

((definite -))))))) 
(subj ect 
( (referent 
((arg 

((concept "kadin"))) 
(agr ((number singular) (person 3)))))))))))))))))) 



Output : 



[[CAT=NOUN] [ROOT=adam] [AGR=3SG] [POSS=NONE] [CASE=NOM]] — > adam 
[[CAT=NOUN] [ROOT=kadIn] [AGR=3SG] [POSS=NONE] [CASE=GEN]] — > kadlnin 
[[CAT=NOUN] [ROOT=kitap] [AGR=3SG] [POSS=NONE] [CASE=NOM]] — > kitap 
[[CAT=VERB] [ROOT=oku] [SENSE=POS] [CONV=NOUN=DIK] 

[AGR=3SG] [P0SS=3SG] [CASE=ACC]] — > okuduGunu 
[[CAT=VERB] [ROOT=zanned] [SENSE=POS] [TAM1=PAST] [AGR=3SG]] — > zannetti 
[PERIOD] — > . 



Example 5: 
Input : 

; Adam kitabi okuyan kadlna elma verdi . 

; The goal has a sentential modifier, which is a gapped sentence. 
; The gap of the sentential clause is its theme. 

((s-form finite) 
(clause-type predicative) 
(voice active) 
(speech-act declarative) 
(verb 
((root "ver") 
(sense positive) 
(tense past) 
(aspect perfect))) 
(arguments 
( (subj ect 
( (referent 
((arg 

((concept "adam"))) 
(agr 

((number singular) 
(person 3))))))) 

(goal 
( (roles 

((role agent) 
(arg 

((s-form participle) 
(clause-type predicative) 
(voice active) 
(speech-act declarative) 
(verb 
((root "oku") 



(sense positive) 
(tense past))) 
(arguments 
( (dir-obj 
( (referent 
((arg 

((concept "kitap"))) 
(agr 

((number singular) 
(person 3))))) 
(specifier 
( (quan 

((definite +))))))) 
(subj ect 
( (referent 
((arg 

((concept" kadin"))) 
(agr 

((number singular) 
(person 3))))))))))))))) 

(dir-obj 
( (referent 
((arg 

((concept "elma"))) 
(agr 

((number singular) 
(person 3))))) 
(specifier 
( (quan 

((definite -)))))))))) 

Output : 

[[CAT=NOUN] [ROOT=adam] [AGR=3SG] [POSS=NONE] [CASE=NOM]] — > adam 
[[CAT=NOUN] [ROOT=kitap] [AGR=3SG] [POSS=NONE] [CASE=ACC]] — > kitabi 



tjj^AMt'Ltj uuiruib Aiyjj rui utttj wuttt^ 



[[CAT=VERB] [ROOT=oku] [SENSE=POS] [CONV=ADJ=YAN]] — > okuyan 
[[CAT=NOUN] [ROOT=kadIn] [AGR=3SG] [POSS=NONE] [CASE=DAT]] — > kadlna 
[[CAT=NOUN] [ROOT=elma] [AGR=3SG] [POSS=NONE] [CASE=NOM]] — > elma 
[[CAT=VERB] [ROOT=ver] [SENSE=POS] [TAM1=PAST] [AGR=3SG]] — > verdi 
[PERIOD] — >. 



yy 



Example 6: 
Input : 

; Adam kadlnin okuduGu kitabi istedi. 

; The direct object has a sentential modifier. 

((s-form finite) 
(clause-type predicative) 
(voice active) 
(speech-act declarative) 
(verb 
((root "iste") 
(sense positive) 
(tense past) 
(aspect perfect))) 
(arguments 
( (subj ect 
( (referent 
((arg 

((concept "adam"))) 
(agr 

((number singular) 
(person 3))))))) 
(dir-obj 
( (roles 

((role theme) 
(arg 

((s-form participle) 
(clause-type predicative) 
(voice active) 
(speech-act declarative) 
(verb 
((root "oku") 
(sense positive) 



tjj^AMt'Ltj uuiruib Aiyjj rui utttj wuttt^ 



iUU 



(tense past))) 
(arguments 
( (dir-obj 
( (referent 
((arg 

((concept "kitap"))) 
(agr 

((number singular) 
(person 3))))) 
(specifier 
( (quan 

((definite +))))))) 
(subj ect 
( (referent 
((arg 

((concept "kadin"))) 
(agr 

((number singular) 
(person 3)))))))))))))))))) 



Output : 



[[CAT=NOUN] [ROOT=adam] [AGR=3SG] [POSS=NONE] [CASE=NOM]] — > adam 
[[CAT=NOUN] [ROOT=kadIn] [AGR=3SG] [POSS=NONE] [CASE=GEN]] — > kadlnin 
[[CAT=VERB] [ROOT=oku] [SENSE=POS] [CONV=AD J=DIK] [P0SS=3SG]] — > okuduGu 
[[CAT=NOUN] [ROOT=kitap] [AGR=3SG] [POSS=NONE] [CASE=ACC]] — > kitabi 
[[CAT=VERB] [ROOT=iste] [SENSE=POS] [TAM1=PAST] [AGR=3SG]] — > istedi 
[PERIOD] — > . 



iUi 



Example 7: 
Input : 

; Adam eve gelir gelmez yattl. 

; Expression of time is a sentential clause representing an 
; adverbial. 

((s-form finite) 
(clause-type predicative) 
(voice active) 
(speech-act declarative) 
(verb 
((root "yat") 
(sense positive) 
(tense past) 
(aspect perfect))) 
(arguments 
( (subj ect 
( (referent 
((arg 

((concept "adam"))) 
(agr 

((number singular) 
(person 3))))))))) 
(adjuncts 
( (time 

((adv-type as-soon-as) 
(argument 
((s-form adverbial) 
(clause-type predicative) 
(voice active) 
(speech-act declarative) 
(verb 
((root "gel"))) 



tjj^AMt'Ltj uuiruib Aiyjj rui utttj wuttt^ 



iUZ 



(arguments 
((goal 

( (referent 
((arg 

((concept "ev"))) 
(agr 

((number singular) 
(person 3)))))))))))))))) 



Output : 



[[CAT=NOUN] [ROOT=adam] [AGR=3SG] [POSS=NONE] [CASE=NOM]] — > adam 
[[CAT=NOUN] [ROOT=ev] [AGR=3SG] [POSS=NONE] [CASE=DAT]] — > eve 
[[CAT=VERB] [ROOT=gel] [SENSE=POS] [TAM1=A0RIST] [AGR=3SG]] — > gelir 
[[CAT=VERB] [ROOT=gel] [SENSE=NEG] [TAM1=A0RIST] [AGR=3SG]] — > gelmez 
[[CAT=VERB] [ROOT=yat] [SENSE=POS] [TAM1=PAST] [AGR=3SG]] — > yatti 
[PERIOD] — > . 



Example 8: 
Input : 

; Adam kadlnin kitabi okumak istediGini zannetti. 

; The direct object is a sentential clause representing a definite 
; act, and the direct object of the sentential clause is also a 
; sentential clause representing an indefinite act. 

((s-form finite) 
(clause-type predicative) 
(voice active) 
(speech-act declarative) 
(verb 
((root "zanned") 
(sense positive) 
(tense past) 
(aspect perfect))) 
(arguments 
( (subj ect 
( (referent 
((arg 

((concept "adam"))) 
(agr 

((number singular) 
(person 3))))))) 
(dir-obj 
( (roles 

((role fact) 
(arg 

((s-form participle) 
(clause-type predicative) 
(voice active) 
(speech-act declarative) 
(verb 



((root "iste") 
(sense positive) 
(tense past) 
(aspect perfect))) 
(arguments 
( (dir-obj 
( (roles 

((role act) 
(arg 

((s-form inf-ind-act) 
(clause-type predicative) 
(voice active) 
(speech-act declarative) 
(verb 
((root "oku") 
(sense positive))) 
(arguments 
( (dir-obj 
( (referent 
((arg 

((concept "kitap"))) 
(agr 

((number singular) 
(person 3))))) 
(specifier 
( (quan 

(definite +)))))))))))))) 

(subj ect 
( (referent 
((arg 

((concept "kadin"))) 
(agr 

((number singular) 
(person 3)))))))))))))))))) 



tjj^AMt'Ltj uuiruib Aiyjj rui utttj wuttt^ 



Output : 

[[CAT=NOUN] [ROOT=adam] [AGR=3SG] [POSS=NONE] [CASE=NOM]] — > adam 
[[CAT=NOUN] [ROOT=kadIn] [AGR=3SG] [POSS=NONE] [CASE=GEN]] — > kadlnin 
[[CAT=NOUN] [ROOT=kitap] [AGR=3SG] [POSS=NONE] [CASE=ACC]] — > kitabi 
[[CAT=VERB] [ROOT=oku] [SENSE=POS] [CONV=NOUN=MAK] [TYPE=INFINITIVE] 

[AGR=3SG] [POSS=NONE] [CASE=NOM]] — > okumak 
[[CAT=VERB] [ROOT=iste] [SENSE=POS] [CONV=NOUN=DIK] 

[AGR=3SG] [P0SS=3SG] [CASE=ACC]] — > istediGini 
[[CAT=VERB] [ROOT=zanned] [SENSE=POS] [TAM1=PAST] [AGR=3SG]] — > zannetti 
[PERIOD] — > . 



iUD 



Example 9: 
Input : 

; Kitap okuyan adam kadlnin susmaslnl istedi 

; The subject has a sentential modifier, the direct object is a 
; sentential clause representing a definite act. 



((s-form finite) 
(clause-type predicative) 
(voice active) 
(speech-act declarative) 
(verb 
((root "iste") 
(sense positive) 
(tense past) 
(aspect perfect))) 
(arguments 
( (subj ect 
( (referent 
((agr 

((number singular) 
(person 3))))) 
(roles 
((role agent) 
(arg 

((s-form participle) 
(clause-type predicative) 
(voice active) 
(speech-act declarative) 
(verb 
((root "oku") 
(sense positive) 
(tense past))) 



(arguments 
( (dir-obj 
( (referent 
((arg 

((concept "kitap"))) 
(agr 

((number singular) 
(person 3))))) 
(specifier 
( (quan 

((definite -))))))) 
(subj ect 
( (referent 
((arg 

((concept "adam"))) 
(agr 

((number singular) 
(person 3))))))))))))))) 

(dir-obj 
( (roles 

((role act) 
(arg 

(s-form inf-def-act) 
(clause-type predicative) 
(voice active) 
(speech-act declarative) 
(verb 
((root "sus") 
(sense positive))) 
(arguments 
( (subj ect 
( (referent 
((arg 

((concept "kadin"))) 
(agr 



tjj^AMt'Ltj uuiruib Aiyjj rui utttj wuttt^ 
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((number singular) 
(person 3))))))))))))))))) 

Output : 

[[CAT=NOUN] [ROOT=kitap] [AGR=3SG] [POSS=NONE] [CASE=NOM]] — > kitap 
[[CAT=VERB] [ROOT=oku] [SENSE=POS] [CONV=ADJ=YAN]] — > okuyan 
[[CAT=NOUN] [ROOT=adam] [AGR=3SG] [POSS=NONE] [CASE=NOM]] — > adam 
[[CAT=NOUN] [ROOT=kadIn] [AGR=3SG] [POSS=NONE] [CASE=GEN]] — > kadlnin 
[[CAT=VERB] [ROOT=sus] [SENSE=POS] [CONV=NOUN=MA] [TYPE=INFINITIVE] 

[AGR=3SG] [P0SS=3SG] [CASE=ACC]] — > susmaslnl 
[[CAT=VERB] [ROOT=iste] [SENSE=POS] [TAM1=PAST] [AGR=3SG]] — > istedi 
[PERIOD] — > . 



5.2 Future Work 

Currently our grammar can not handle discontinuous constituents and certain 
long distance scramblings. Although these are seen very rarely in Turkish text, 
our grammar can be extended to handle these. The information can also be 
planned before being sent to the generator. This planning can be done using 
syntactic cues in the source language in machine translation [7, 19] or using 
Centering Theory [20] and given versus new information [11]. 

We have designed and implemented our generator taking into consideration 
that, it can also be used for interlingual machine translation or with a strategic 
generator, as demonstrated in Figure 5.1. 



tjj^AMt'Ltj uuiruib Aiyjj rui utttj wuttt^ 
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Figure 5.1: Future work. 



Chapter 6 



Conclusions 



In this thesis, we have presented the highhghts of our work on tactical generation 
in Turkish - a free constituent order language with agglutinative word structures. 
In addition to the content information, our generator takes as input the informa- 
tion structure of the sentence (topic, focus and background) and uses these to 
select the appropriate word order. In the absence of any information structure 
constituents, our generator generates sentences in a default order. 

Our grammar uses a right-linear rule backbone which implements a (recursive) 
finite state machine for dealing with alternative word orders. The recursive be- 
haviour of this finite state machine comes from the fact that, individual adjunct 
or argument constituents can also embed sentential clauses. These sentential 
clauses are generated using the same transitions with the sentences. 

We have also provided for constituent order and stylistic variations within 
noun phrases based on certain emphasis and formality features. Our aim was 
to build a wide-coverage generator for Turkish for use in a machine translation 
application. We plan to use this generator in a prototype transfer-based human 
assisted machine translation system from English to Turkish, where the domain 
of translation is computer manuals. 

We have designed the tactical generator taking into consideration that it can 
also be used in interlingual machine translation or with a strategic generator. 
Since concepts like long distance scramblings and discontinuous constituents are 
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rarely seen in technical documents, we opted not to deal with them, but our 
grammar can be extended to cover these. 
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Appendix A 



A List of Suffixes Making 
Adverbials 



The following table lists the suffixes making temporal and manner adverbials: 



SufRx 


Roles 


Example 


+r ...+mAz 


Temporal 


Adam eve gelir gelmez uyudu. 


+dH ...+A1H 


Temporal, Manner 


Annem gitti gideli uyuyamiyorum. 


+dHkcA 


Temporal, Manner 


Kiiciik kirmizi top gittikce hizlandi. 


+ken 


Temporal, Manner 


Kitap okurken uyumusum. 


+mAdAn 


Temporal, Manner 


Sen uyumadan gideyim. 


+yHp 


Temporal, Manner 


Kizmca viiriivup gitti. 


+ArAk 


Manner 


Kosarak uzaklasti. 


+cAsInA 


Manner 


Ucrcasma disari cikti. 


+A ...+A 


Manner, Temporal 


Kosa kosa gitti. 


+yIncA 


Manner, Temporal 


Ben gelince o gitti. 


+yAll 


Temporal 


Sen geleli o uyuyor. 
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Appendix B 



Gapped Sentential Clauses 



The following tables demonstrate the verb forms of sentential clauses in different 
situations: 



Role 


Voice 


Tense 


Trans. 


Subject 


Part-Form 


Example 


Agent 


Act. 


Past/ 


Trans . 


spec. /non-spec. 


+yAn 


Qocuga elma satan adam 


Exper. 




Pres. 


Intrans. 




+yAn 


Qok uyuyan gocuk 






Put. 


Trans . 


spec. /non-spec. 


+yAcAk 


Qocuga elmayi satacak adam 








Intrans. 




+yAcAk 


Uyuyacak gocuk 






Past + 


Trans . 


spec. /non-spec. 


+mH§ 


Yemegini yemi§ gocuk 






Narr. 


Intrans. 




+mH§ 


Okumu§ gocuk 




Pass. 


Past/ 


Trans . 


spec. /non-spec. 










Pres. 


Intrans. 












Put. 


Trans . 


spec. /non-spec. 












Intrans. 












Past + 


Trans . 


spec. /non-spec. 










Narr. 


Intrans. 










Caus. 


Past/ 


Trans . 


spec. /non-spec. 


+yAn 


Qocuga yemek yediren kadm 






Pres. 


Intrans. 




+yAn 


Qocugu uyutan kadm 






Put. 


Trans . 


spec. /non-spec. 


+yAcAk 


Qocuga yemek yedirecek kadm 








Intrans. 




+yAcAk 


Qocugu uyutacak kadm 






Past + 


Trans . 


spec. /non-spec. 


+mH§ 


Qocuga yemek yedirmi§ kadm 






Narr. 


Intrans. 




+mH§ 


Qocugu uyutmu§ kadm 
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Role 


Voice 


Tense 


Trans. 


Subject 


Part-Form 


Example 


Patient 


Act. 


Past/ 


Trans 


spec . 1 uou-spec. 


-|-dHk-|-POSS 


A.dauuu okudu^u kitap 


Theme 






lutraus . 












Put. 




spec . 1 uou-spec. 


-|-yAcAk-|-POSS 


A.dauuu okuyaca^i kitap 








lutraus . 










~P^s 


Past/ 




spec . 1 uou-spec. 


-l-v Au 


A.daru tarafiudau okuuau kitap 








lutraus . 












Put. 




spec . 1 uou-spec. 


-|-yAcAk 


A.daiu tarafiudau okuuacak kitap 








lutraus. 










Caus. 


Past/ 


Traus . 


spec. /uou-spec. 


-FdHk-FPOSS 


Kadiuiu gocuga yedirdigi yeiuek 






Pres. 


lutraus. 


_ 


_ 


_ 






Put. 


Traus . 


spec. /uou-spec. 


-FyAcAk-FPOSS 


Kadiuiu gocuga yedirecegi yeiuek 








lutraus. 


_ 


_ 


_ 


Source 


A.cti ve 


Past/ 


Trans 


spec . / uou-spec. 


-|-dHk-|-POSS 


^ocuguu deuize ta§ attigi iskele 








lutraus . 




-|-dHk-|-POSS 


^ocuguu eve yurudugu okul 






Put. 




spec . / uou-spec. 


-KyAcAk-^POSS 


^ocuguu deuize ta§ atacagi iskele 








lutraus . 




-|-yAcAk-|-POSS 


^ocuguu eve yuruyecegi okul 




Passive 


Past/ 






-|-dHk-|-POSS 


Ta^iu deuize atildigi iskele 










uou-spec . 


-l-v Au 


Deuize ta§ atilau iskele 








lutraus . 




-l-v Au 


Okula yuruueu ev 






Put. 






-|-yAcAk-|-POSS 


Ta^iu deuize atilacagi iskele 










uou-spec . 


-|-yAcAk 


Deuize ta§ atilacak iskele 








lutraus. 




-KyAcAk 


Okula viiriiuecek ev 




Caus. 


Past/ 


Traus . 


spec. /uou-spec. 


-FdHk-FPOSS 


Kadiuiu deuize ta§ attirdigi iskele 






Pres. 


lutraus. 


_ 


-FdHk-FPOSS 


Kadiuiu gocugu okula yiiruttiigu ev 






Put. 


Traus . 


spec. /uou-spec. 


-FyAcAk-FPOSS 


Kadiuiu deuize ta§ attiracagi iskele 








lutraus. 


_ 


-FyAcAk-FPOSS 


Kadiuiu gocugu okula yiiriitecegi ev 


Goal 


Act. 


Past/ 


Trans 


spec . / uou-spec. 


-|-dHk-|-POSS 


^ocuguu iskeledeu ta§ attigi deuiz 








lutraus . 




-|-dHk-|-POSS 


^ocuguu okuldau yurudugu ev 






Put. 




spec . / uou-spec. 


-|-yAcAk-|-POSS 


^ocuguu iskeledeu ta§ atacagi deuiz 








lutraus . 




-|-yAcAk-|-POSS 


^ocuguu okuldau yuruyecegi ev 




~P^s 


Past/ 






-|-dHk-|-POSS 


Ta^iu iskeledeu atildigi deuiz 










uou-spec . 


-l-v Au 


Iskeledeu ta§ atilau deuiz 








lutraus . 




-l-v Au 


Evdeu yuruueu okul 






Put. 






-|-yAcAk-|-POSS 


Ta^iu iskeledeu atilacagi deuiz 










uou-spec . 


-(-yAcAk 


Iskeledeu ta§ atilacak deuiz 








lutraus. 




-|-yAcAk 


Evdeu yuruuecek okul 




Caus. 


Past/ 


Traus . 


spec. /uou-spec. 


-FdHk-FPOSS 


Kadiuiu iskeledeu ta§ attirdigi deuiz 






Pres. 


lutraus. 




-FdHk-FPOSS 


Kadiuiu gocugu evdeu yiiruttiigu okul 






Put. 


Traus . 


spec. /uou-spec. 


-FyAcAk-FPOSS 


Kadiuiu iskeledeu ta§ attiracagi deuiz 








lutraus. 




-FyAcAk-FPOSS 


Kadiuiu gocugu yiiriitecegi okul 
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Role 


Voice 


Tense 


Trans. 


Subject 


Part-Form 


Example 


Loc 


A-Ctl VG 


Past/ 


Trans 


spec . 1 non-spcc . 


-l-dHk+POSS 


^ocu^un kitap okudu^u masa 












-l-dHk+POSS 


^ocu^un uyudu^u yatak 






Put. 




spec . 1 non-spcc . 


-l-yAcAk+POSS 


^ocu^un kitap okuyaca^i masa 












-|-yAcAk-|-POSS 


^ocu^un okuldan uyuyaca^i yatak 




PclSSl VG 


Past/ 






-l-dHk+POSS 


kitabm okundu^u masa 










non-spcc . 


-l-vAn 


kitap okunan masa 












-l-v An 


uyunan yatak 






Put. 






-l-yAcAk+POSS 


kitabm okunaca^i masa 










non-spcc . 


-|-yAcAk 


kitap okunacak masa 








Intrans. 




+yAcAk 


Uyunacak yatak 




Caus. 


Past/ 


Trans . 


spec. /non-spec. 


+dHk+POSS 


Kadinm gocuga yemek yedirdigi masa 






PrGS. 


Intrans. 


_ 


+dHk+POSS 


Kadinm gocugu evden uyuttugu masa 






Put. 


Trans . 


spec. /non-spec. 


+yAcAk+POSS 


Kadinm gocuga yemek yedirecegi masa 








Intrans. 


_ 


+yAcAk+POSS 


Kadinm gocugu uyutacagi yatak 


Benef 


Actl VG 


Past/ 


Trans 


spec . / non-spec . 


+dHk+POSS 


Kadinm kitap okudugu ^ocuk 








Intrans . 












Put. 




spec . 1 non-spec . 


-l-yAcAk+POSS 


K^adinm kitap okuyacagi ^ocuk 








Intrans . 










PclSSl VG 


Past/ 






-l-dHk+POSS 


K^itabm okundugu ^ocuk 










non-spec . 


-l-v An 


K^itap okunan ^ocuk 








Intrans . 












Put. 






-|-yAcAk-|-POSS 


K^itabm okunacagi ^ocuk 










non-spec . 


-|-yAcAk 


K^itap okunacak ^ocuk 








Intrans. 










Caus. 


Past/ 


Trans . 


spec. /non-spec. 










PrGS. 


Intrans. 


_ 




_ 






Put. 


Trans . 


spec. /non-spec. 




_ 








Intrans. 


_ 




- 


C-obj 


Caus. 


Past/ 


Trans . 


spec. /non-spec. 


+dHk+POSS 


Kadinm yemek yedirdigi gocuk 






PrGS. 


Intrans. 




+dHk+POSS 


Kadinm uyuttugu gocuk 






Put. 


Trans . 


spec. /non-spec. 


+yAcAk+POSS 


Kadinm yemek yedirecegi gocuk 








Intrans. 




+yAcAk+POSS 


Kadinm uyutacagi gocuk 



Role 


Voice 


Tense 


Trans. 


Subject 


Part-Form 


Example 


Rec. 


A-Cti ve 


Past/ 


Trans 


spec . 1 non-spcc . 


-t-dHk+POSS 


K^adinm kitap verdi^i ^ocuk 




















Put. 




spec . 1 non-spcc . 


-t-yAcAk+POSS 


K^adinm kitap verece^i ^ocuk 


















Passive 


Past/ 






-t-dHk+POSS 


K^itabm verildi^i ^ocuk 










non-spcc . 


-l-vAn 


Kitap verilen ^ocuk 




















Put. 






-l-yAcAk-l-POSS 


K^itabm verilece^i ^ocuk 










non-spcc . 


-t-yAcAk 


K^itap verilecek ^ocuk 








Intrans. 










Caus. 


Past/ 


Trans . 


spec. /non-spec. 


-l-dHk-l-POSS 


Kadinin adama kitap verdirdigi gocuk 






Pres. 


Intrans. 


_ 




_ 






Put. 


Trans . 


spec. /non-spec. 


-l-yAcAk-l-POSS 


Kadinin adama kitap verdirecegi gocuk 








Intrans. 


_ 




_ 


Time 


Active 


Past/ 


Trans 


spec . / non-spec . 


-l-dHk-l-POSS 


Adaniin elmayi sattigi zaman/ sut6 


IHr 






Intrans . 




-l-dHk-l-POSS 


^ocugun uy^udugu zaman/ sutc 






Put. 




spec . / non-spec . 


-l-yAcAk-l-POSS 


A.damin elmay^i satacagi zaman/ sutc 








Intrans . 




-l-yAcAk-l-POSS 


^ocugun uy^uy^acagi zaman/ sutc 




Passive 


Past/ 






-l-dHk-l-POSS 


K^itabm okundugu zaman/ sutc 










non-spec . 


-t-v An 


K^itap okunan zaman/ sutc 








Intrans . 




-l-dHk-l-POSS 


Okula y^urundugu zaman 






Put. 






-|-yAcAk-|-POSS 


K^itabm okunacagi zaman/ sutc 










non-spec . 


-|-yAcAk 


K^itap okunacak zaman/ sutc 








Intrans. 




-l-yAcAk 


Okula yiiriinecek zaman 




Caus. 


Past/ 


Trans . 


spec. /non-spec. 


-l-dHk-l-POSS 


Kadinm yemek yedirdigi zaman/ sutc 






Pres. 


Intrans. 


_ 


-l-dHk-l-POSS 


Kadmm gocugu uyuttugu zaman/ sutc 






Put. 


Trans . 


spec. /non-spec. 


-l-yAcAk-l-POSS 


Kadinm yemek yedirecegi zaman/ sutc 








Intrans. 


_ 


-l-yAcAk-l-POSS 


Kadmm gocugu uyutacagi zaman/ sutc 


Place 


A-Cti ve 


Past/ 


Trans 


spec . / non-spec . 


-|-dHk-|-POSS 


^ocugun yemek yedigi masa 








Intrans . 




-|-dHk-|-POSS 


^ocugun yurudugu yol 






Put. 




spec . / non-spec . 


-|-yAcAk-|-POSS 


^ocugun yemek yiyecegi masa 








Intrans . 




-|-yAcAk-|-POSS 


^ocugun yuruyecegi yol 




Passive 


Past/ 






-|-dHk-|-POSS 


^^emegm yendigi masa 










non-spec . 


-t-v An 
-py ^11 


^^emek yenilen masa 








Intrans . 




-l-vAn 


Yurunen yol 






Put. 






-|-yAcAk-|-POSS 


^^emegm yenecegi masa 










non-spec . 


-|-yAcAk 


^^emek yenecek masa 








Intrans. 




-|-yAcAk 


Yurunecek yol 




Caus. 


Past/ 


Trans . 


spec. /non-spec. 


-l-dHk-l-POSS 


Kadmm gocuga yemek yedirdigi masa 






Pres. 


Intrans. 




-l-dHk-l-POSS 


Kadmm gocugu yuriittugd yol 






Put. 


Trans . 


spec. /non-spec. 


-l-yAcAk-l-POSS 


Kadmm gocuga yemek yedirecegi masa 








Intrans. 




-l-yAcAk-l-POSS 


Kadmm gocugu yiiriitecegi yol 



Appendix C 



Examples from Computer 
Manuals 

Example 1: 
Input : 

; Dosyalardaki belgelerle CallSma gOrUntUsU her gOsterildiGinde , 
; dosya bilgi isteminde, en son kullandlGInIz dosyanin adi yer allr 

((s-form finite) 
(clause-type predicative) 
(voice active) 
(speech-act declarative) 
(verb 
((root "yer-al") 
(sense positive) 
(tense present) 
(aspect aorist))) 
(arguments 
( (subj ect 
( (referent 
((arg 

((concept "ad"))) 
(agr 

((number singular) 
(person 3))))) 
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(specifier 
( ( quan 

((definite +))))) 
(possessor 
( (roles 

((role theme) 
(arg 

((s-form participle) 
(clause-type predicative) 
(voice active) 
(speech-act declarative) 
(verb 
((root "kullan") 
(sense positive) 
(tense past))) 
(arguments 
( (subj ect 
( (referent 
((agr 

((number plural) 
(person 2))))))) 
(dir-obj 
( (referent 
((arg 

((concept "dosya"))) 
(agr 

((number singular) 
(person 3))))) 
(specifier 
( ( quan 

((definite +))))))))) 
(adjuncts 
( (time 

((p-name "son") 
(intensif ier 
((degree "en"))))))))))))))))) 

(control 
((is 

((focus subject))))) 
(adjuncts 
( (place 

( (referent 



LZZ 



((arg 

( (referent 
((arg 

((concept "istem"))) 
(agr 

((number singular) 
(person 3))))) 
(classifier 
( (referent 
((arg 

((concept "bilgi"))) 
(agr 

((number singular) 
(person 3))))))))) 

(agr 

((number singular) 
(person 3))))) 
(classifier 
( (referent 
((arg 

((concept "dosya"))) 
(agr 

((number singular) 
(person 3))))))))) 

(time 
( (roles 

((role fact) 
(arg 

((s-form participle) 
(clause-type predicative) 
(voice passive) 
(speech-act declarative) 
(verb 
((root "gOster") 
(sense positive) 
(tense past))) 
(arguments 
( (subj ect 
( (referent 
((arg 

((concept "gOrUntU"))) 
(agr 
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((number singular) 
(person 3))))) 
(specifier 
( ( quan 

((definite -))))) 
(classifier 
( (referent 
( (roles 

((role act) 
(arg 

((s-form inf-def-act) 
(clause-type predicative) 
(voice active) 
(speech-act declarative) 
(verb 
((root "CallS") 
(sense positive))) 
(arguments 
( (instrument 
( (referent 
((arg 

((concept "beige"))) 
(agr 

((number plural) 
(person 3))))) 
(specifier 
( (spec-rel 

((relation location) 
(argument 
( (referent 
((arg 

((concept "dosya"))) 
(agr 

((number plural) 
(person 3))) 
(sem ((temporal -)))))))))))))))) 
))))))))))) 

(adjuncts 
( (manner 

( (quantifier 

((root her)))))))))))))))) 



Output : 

[ [CAT=IOOT] [ROOT=dosya] [AGR=3PL] [POSS=IOTE] [CASE=LOC] 

[COIV=ADJ=REL]] — > dosyalardaki 
[ [CAT=IOOT] [ROOT=belge] [AGR=3PL] [POSS=IOTE] [CASE=IIS] ] — > belgelerle 
[[CAT=VERB] [ROOT=CalIS] [SEISE=POS] [COIV=IOOT=MA] [TYPE=IIFIIITIVE] 

[AGR=3SG] [POSS=IOIE] [CASE=IOM] ] — > CallSma 
[ [CAT=IOOT] [ROOT=gOrUntU] [AGR=3SG] [P0SS=3SG] [CASE=IOM] ] — > gOrUntUsU 
[[CAT=ADJ] [ROOT=her] [TYPE=DETERMIIER]] — > her 

[[CAT=VERB] [ROOT=gOster] [VOICE=PASS] [SEISE=POS] [COIV=IOOT=DIK] 

[AGR=3SG] [P0SS=3SG] [CASE=LOC]] — > gOsterildiGinde 
[ [CAT=IOOT] [ROOT=dosya] [AGR=3SG] [POSS=IOTE] [CASE=IOM] ] — > dosya 
[ [CAT=IOOT] [ROOT=bilgi] [AGR=3SG] [POSS=IOTE] [CASE=IOM] ] — > bilgi 
[ [CAT=IOOT] [ROOT=istem] [AGR=3SG] [P0SS=3SG] [CASE=LOC]] — > isteminde 
[[CAT=ADVERB] [ROOT=en]] — > en 
[[CAT=ADJ] [ROOT=son]] — > son 

[[CAT=VERB] [ROOT=kullan] [SEISE=POS] [COIV=AD J=DIK] [P0SS=2PL]] 
— > kullandlGInIz 

[ [CAT=IOOT] [ROOT=dosya] [AGR=3SG] [POSS=IOTE] [CASE=GEI] ] — > dosyanin 
[[CAT=IOOT] [ROOT=ad] [AGR=3SG] [P0SS=3SG] [CASE=IOM]] — > adi 
[[CAT=VERB] [ROOT=yer-al] [SEISE=POS] [TAM1=A0RIST] [AGR=3SG]] — > yer-allr 
[PERIOD] — > . 



Example 2: 
Input : 

; kullanim tanltim alanlnda beige biCimini tanlmlayan metin 
; tanltlmlnin adI yer allr 

((s-form finite) 
(clause-type predicative) 
(voice active) 
(speech-act declarative) 
(verb 
((root "yer-al") 

(sense positive) 

(tense present) 

(aspect aorist))) 
(control 



((is 

((focus subject))))) 
(adjuncts 
( (place 

( (referent 
((arg 

( (referent 
((arg 

((concept "alan"))) 
(agr 

((number singular) 
(person 3))))) 
(classifier 
( (referent 
((arg 

((concept "tanltim"))) 
(agr 

((number singular) 
(person 3))))))))) 

(agr 

((number singular) 
(person 3))))) 
(classifier 
( (referent 
((arg 

((concept "kullanim") ) ) 
(agr 

((number singular) 
(person 3))))))))))) 

(arguments 
( (subj ect 
( (referent 
((arg 

((concept "ad"))) 
(agr 

((number singular) 
(person 3))))) 
(possessor 
( (roles 

((role agent) 
(arg 

((s-form participle) 



(clause-type predicative) 
(voice active) 
(speech-act declarative) 
(verb 
((root "tanlmla") 
(sense positive) 
(tense past) ) ) 
(arguments 
( (dir-obj 
( (referent 
((arg 

((concept "biCim"))) 
(agr 

((number singular) 
(person 3))))) 
(classifier 
( (referent 
((arg 

((concept "beige"))) 
(agr 

((number singular) 
(person 3))))))))) 

(subj ect 
( (referent 
((arg 

((concept "tanltim"))) 
(agr 

((number singular) 
(person 3))))) 
(classifier 
( (referent 
((arg 

((concept "metin"))) 
(agr 

((number singular) 
(person 3)))))))))))))))))))))) 

Output : 

[ [CAT=IOOT] [ROOT=kullanIm] [AGR=3SG] [POSS=IOIE] [CASE=IOM] ] — > kullanim 
[ [CAT=IOOT] [ROOT=tanItIm] [AGR=3SG] [POSS=IOIE] [CASE=IOM] ] — > tanltim 
[ [CAT=IOOT] [ROOT=alan] [AGR=3SG] [P0SS=3SG] [CASE=LOC]] — > alanlnda 



[ [CAT=IOOT] [ROOT=belge] [AGR=3SG] [POSS=IOTE] [CASE=IOM] ] — > beige 

[ [CAT=IOOT] [ROOT=biCim] [AGR=3SG] [P0SS=3SG] [CASE=ACC]] — > biCimini 

[[CAT=VERB] [ROOT=tanImla] [SEISE=POS] [COIV=AD J=YAI] ] — > tanlmlayan 

[ [CAT=IOOT] [ROOT=metin] [AGR=3SG] [POSS=IOTE] [CASE=IOM] ] — > met in 

[ [CAT=IOOT] [ROOT=tanItIm] [AGR=3SG] [P0SS=3SG] [CASE=GEI] ] — > tanltlmlnin 

[[CAT=IOOT] [ROOT=ad] [AGR=3SG] [P0SS=3SG] [CASE=IOM]] — > adi 

[[CAT=VERB] [ROOT=yer-al] [SEISE=POS] [TAM1=A0RIST] [AGR=3SG]] — > yer-allr 

[PERIOD] — > . 



Example 3: 



Input : 

; Metni yazarken kullanabileceGiniz iSlev tuSlarlnl iCeren 

; listenin gOrUntUnUn alt bOlUmUnde gOsterilmesini seCebilirsiniz 

((s-form finite) 
(clause-type predicative) 
(voice active) 
(speech-act declarative) 
(verb 
((root "seC") 
(sense positive) 
(modality yabil) 
(tense present) 
(aspect aorist))) 
(arguments 
( (subj ect 
( (referent 
((agr 

((number plural) 
(person 2))))))) 
(dir-obj 
( (roles 

((role act) 
(arg 

((s-form inf-def-act) 
(clause-type predicative) 
(voice passive) 
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(speech-act declarative) 
(verb 
((root "gOster") 
(sense positive))) 
(arguments 
( (subj ect 
( (referent 
((arg 

((concept "liste"))) 
(agr 

((number singular) 
(person 3))))) 
(roles 
((role agent) 
(arg 

((s-form participle) 
(clause-type predicative) 
(voice active) 
(speech-act declarative) 
(verb 
((root "iCer") 
(sense positive) 
(tense past) ) ) 
(arguments 
( (subj ect 
( (referent 
((arg 

((concept "liste"))) 
(agr 

((number singular) 
(person 3))))))) 
(dir-obj 
( (roles 

((role theme) 
(arg 

((s-form participle) 
(clause-type predicative) 
(voice active) 
(speech-act declarative) 
(verb 
((root "kullan") 
(sense positive) 



(modality yabil) 
(tense future))) 
(arguments 
( (dir-obj 
( (referent 
((arg 

((concept "tuS"))) 
(agr 

((number plural) 
(person 3))))) 
(classifier 
( (referent 
((arg 

((concept "iSlev"))) 
(agr 

((number singular) 
(person 3))))))) 
(specifier 
( ( quan 

((definite +))))))) 
(subj ect 
( (referent 
((agr 

((number plural) 
(person 2))))))))) 
(adjuncts 
( (time 

((adv-type while) 
(argument 
((s-form adverbial) 
(clause-type predicative) 
(voice active) 
(speech-act declarative) 
(verb 
((root "yaz") 
(sense positive) 
(tense present) 
(aspect aorist))) 
(arguments 
( (dir-obj 
( (referent 
((arg 



((concept "metin"))) 
(agr 

((number singular) 
(person 3))))) 
(specifier 
( ( quan 

((definite +))))))))))))) 

)))))))))))))))) 

(location 
( (referent 
((arg 

((concept "bOlUm"))) 
(agr 

((number singular) 
(person 3))))) 
(modifier 
( (qualitative 

((p-name "alt"))))) 
(possessor 
( (referent 
((arg 

((concept "gOrUntU"))) 
(agr 

((number singular) 
(person 3)))))))))))))))))))) 

Output : 

[ [CAT=IOOT] [ROOT=metin] [AGR=3SG] [POSS=IOIE] [CASE=ACC]] — > metni 
[[CAT=VERB] [ROOT=yaz] [SEISE=POS] [TAM1=A0RIST] [COTV=ADVERB=KEI] ] 
— > yazarken 

[[CAT=VERB] [ROOT=kullan] [SEISE=POS] [COMP=YABIL] [COIV=AD J=YACAK] 

[P0SS=2PL]] — > kullanabileceGiniz 
[ [CAT=IOOT] [ROOT=iSlev] [AGR=3SG] [POSS=IOIE] [CASE=IOM] ] — > iSlev 
[ [CAT=IOOT] [ROOT=tuS] [AGR=3PL] [P0SS=3SG] [CASE=ACC]] — > tuSlarlnl 
[[CAT=VERB] [ROOT=iCer] [SEISE=POS] [COIV=ADJ=YAI]] — > iCeren 
[ [CAT=IOOT] [ROOT=liste] [AGR=3SG] [POSS=IOIE] [CASE=GEI] ] — > listenin 
[ [CAT=IOOT] [ROOT=gOrUntU] [AGR=3SG] [POSS=IOIE] [CASE=GEI] ] 
— > gOrUntUnUn 

[[CAT=ADJ] [ROOT=alt]] — > alt 

[ [CAT=IOOT] [R00T=b01Um] [AGR=3SG] [P0SS=3SG] [CASE=LOC]] — > bOlUmUnde 
[[CAT=VERB] [ROOT=gOster] [VOICE=PASS] [SEISE=POS] [COIV=IOOT=MA] 
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[TYPE=IIFIIITIVE] [AGR=3SG] [P0SS=3SG] [CASE=ACC]] — > gOsterilmesini 
[[CAT=VERB] [ROOT=seC] [SEISE=POS] [COMP=YABIL] [TAM1=A0RIST] [AGR=2PL]] 
— > seCebilirsiniz 
[PERIOD] — > . 



Example4: 
Input : 

; gOrUntUdeki ikinci satir, yUrUrlUkteki kenar boSluGu ve sekme 
; noktasi ayarlarlnl gOsteren OlCek satlrldir 

((s-form finite) 
(clause-type attributive) 
(rel is-a) 
(voice active) 
(speech-act declarative) 
(verb 
((root to-be) 
(sense positive) 
(tense present) 
(aspect aorist))) 
(arguments 
( (subj ect 
( (referent 
((arg 

((concept "satIr"))) 
(agr 

((number singular) 
(person 3))))) 
(modifier 
( (ordinal 

((order 2))))) 
(specifier 
( (spec-rel 

((relation location) 
(argument 
( (referent 
((arg 

((concept "gOrUntU"))) 
(agr 
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((number singular) 
(person 3))) 
(sem ((temporal -))))))))))))) 
(pred-property 
( (roles 

((role agent) 
(arg 

((s-form participle) 
(clause-type predicative) 
(voice active) 
(speech-act declarative) 
(verb 
((root "gOster") 
(sense positive) 
(tense past))) 
(arguments 
( (subj ect 
( (referent 
((arg 

((concept "satir"))) 
(agr 

((number singular) 
(person 3))))) 
(classifier 
( (referent 
((arg 

((concept "OlCek"))) 
(agr 

((number singular) 
(person 3))))))))) 

(dir-obj 
((conj ve) 
(elements 
(*multiple* 
( (referent 
((arg 

((concept "boSluk"))) 
(agr 

((number singular) 
(person 3))))) 
(classifier 
( (referent 
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((arg 

((concept "kenar"))) 
(agr 

((number singular) 
(person 3))))))) 
(specifier 
( (spec-rel 

((relation location) 
(argument 
( (referent 
((arg 

((concept "yUrUrlUk"))) 
(agr 

((number singular) 
(person 3))) 
(sem ((temporal -)))))))))))) 
( (referent 
((arg 

((concept "ayar"))) 
(agr 

((number plural) 
(person 3))))) 
(classifier 
( (referent 
((arg 

((concept "nokta"))) 
(agr 

((number singular) 
(person 3))))) 
(classifier 
( (referent 
( (roles 

((role act) 
(arg 

((s-form inf-def-act) 
(clause-type predicative) 
(voice active) 
(speech-act declarative) 
(verb 
((root "sek") 
(sense positive))))))))))))) 

(specifier 
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( ( quan 

((definite +))))))))) 

))))))))))))) 

Output : 

[ [CAT=IOOT] [ROOT=gOrUntU] [AGR=3SG] [POSS=IOIE] [CASE=LOC] 

[COIV=ADJ=REL]] — > gOrUntUdeki 
[[CAT=ADJ] [ROOT=ikinci] [TYPE=ORDIML]] — > ikinci 
[ [CAT=IOOT] [ROOT=satIr] [AGR=3SG] [POSS=IOTE] [CASE=IOM] ] — > satir 
[ [CAT=IOOT] [ROOT=yUrUrlUk] [AGR=3SG] [POSS=IOIE] [CASE=LOC] 

[COIV=ADJ=REL]] — > yUrUrlUkteki 
[ [CAT=IOOT] [ROOT=kenar] [AGR=3SG] [POSS=IOTE] [CASE=IOM] ] — > kenar 
[ [CAT=IOOT] [ROOT=boSluk] [AGR=3SG] [P0SS=3SG] [CASE=IOM] ] — > boSluGu 
[[CAT=COm] [ROOT=ve]] — > ve 

[[CAT=VERB] [ROOT=sek] [SEISE=POS] [COTV=IOOT=MA] [TYPE=IIFIIITIVE] 

[AGR=3SG] [POSS=IOIE] [CASE=IOM] ] — > sekme 
[ [CAT=IOOT] [ROOT=nokta] [AGR=3SG] [P0SS=3SG] [CASE=IOM] ] — > noktasi 
[ [CAT=IOOT] [ROOT=ayar] [AGR=3PL] [P0SS=3SG] [CASE=ACC]] — > ayarlarlnl 
[[CAT=VERB] [ROOT=gOster] [SEISE=POS] [COIV=ADJ=YAI]] — > gOrteren 
[ [CAT=IOOT] [R00T=01Cek] [AGR=3SG] [POSS=IOTE] [CASE=IOM] ] — > OlCek 
[ [CAT=IOOT] [ROOT=satIr] [AGR=3SG] [P0SS=3SG] [CASE=IOM] 

[COIV=VERB=IOIE] [TAM2=PRES] [C0PULA=2] [AGR=3SG] ] — > satlrldir 
[PERIOD] — > . 



